Hopf algebras and Markov chains: Two examples and a theory 
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Abstract 

The operation of squaring (coproduct followed by product) in a combinatorial Hopf algebra 
is shown to induce a Markov chain in natural bases. Chains constructed in this way include 
widely studied methods of card shuffling, a natural "rock-breaking" process, and Markov chains 
on simplicial complexes. Many of these chains can be explictly diagonalized using the primitive 
elements of the algebra and the combinatorics of the free Lie algebra. For card shuffling, this 
gives an explicit description of the eigenvectors. For rock-breaking, an explicit description of 
the quasi-stationary distribution and sharp rates to absorption follow. 



1 Introduction 

A Hopf algebra is an algebra % with a coproduct A : % — > % <S> T~t which fits together with the 



product m : T-LgtH — > %. Background on Hopf algebras is in Section 2.2 The map raA :?{—>• % is 
called the Hopf-square (often denoted ^ 2 or x^). Our first discovery is that the coefficients of x^ 
in natural bases can often be interpreted as a Markov chain. Specializing to familiar Hopf algebras 
can give interesting Markov chains: the free associative algebra gives the Gilbert-Shannon-Reeds 
model of riffle shuffling. Symmetric functions give a rock-breaking model of Kolmogoroff [Kol41]. 
These two examples are developed first for motivation. 

Example 1.1 (Free associative algebra and riffle shuffling). Let x%, x%, . . . ,x n be noncommuting 
variables and H = k{x\, . . . , x n ) be the free associative algebra. Thus % consists of finite lin- 
ear combinations of words x^x^ ■ ■ ■ Xi k in the generators with the concatenation product. The 
coproduct A is an algebra map defined by A(xj) = 1 ® Xi + Xi (8) 1 and extended linearly. Consider 

A(ari 1 • • -Xi k ) = (1 (8) Xi x + xi Y <g> 1)(1 <g> Xi 2 + Xi 2 (g> 1) ■ ■ ■ (1 (g> Xi k + x ik <g) 1). 

A term in this product results from a choice of left or right from each factor. Equivalently, for each 
subset S C {1, 2, . . . , k}, there corresponds the term 



n**i® n 



Xi - . 
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Thus mA is a sum of 2 k terms resulting from removing {xi j }j^s an d moving them to the front. 
For example, 

mA(xix 2 x 3 ) = xix 2 x 3 + xix 2 x 3 + X2X1X3 + X3X1X2 + xix 2 x 3 + xix 3 x 2 + x 2 x 3 xi + xix 2 x 3 . 

Dividing mA by 2 k , the coefficient of a word on the right is exactly the chance that this word 
appears in a Gilbert-Shannon-Reeds inverse shuffle of a deck of cards labeled by x\ in initial 
order Xi x x,- l2 . . .X{ k . Applying ^mA in the dual algebra gives the usual model for riffle shuffling. 



Background on these models is in Section 5 As shown there, this connection between Hopf algebras 



and shuffling gives interesting new theorems about shuffling. 

Example 1.2 (Symmetric functions and rock-breaking). Let us begin with the rock-breaking 
description. Consider a rock of total mass n. Break it into two pieces according to the symmetric 
binomial distibution: 

-Pjleft piece has mass j} = j < j < n. 

Continue, at the next stage breaking each piece into — ji}, {j 2 -,n — j — j 2 } by independent 

binomial splits. The process continues until all pieces are of mass one when it stops. This description 
gives a Markov chain on partitions of n, absorbing at l n . 

This process arises from the Hopf-square map applied to the algebra A = A(xi, x%, . . . , x n ) of 
symmetric functions, in the basis of elementary symmetric functions e\. This is an algebra under 
the usual product. The coproduct, following [Gei77] is defined by 

A(ej) = eo <8> e.% + e\ ® H V a ® eo, 

extended multiplicatively and linearly. This gives a Hopf algebra structure on A which is a central 
object of study in algebraic combinatorics. It is discussed in |Section 2.4 Rescaling the basis 



elements to e, = z!ej, a direct computation shows that to A in the {e\} basis gives the rock-breaking 
process; see |Section 4.1 



A similar development works for any Hopf algebra which is either a polynomial algebra as an 
algebra (for instance, the algebra of symmetric functions, with generators e n ), or is cocommutative 
and a free associative algebra as an algebra (e.g., the free associative algebra), provided each object 
of degree greater than one can be broken non-trivially. These results are described in Theorem |3.4| 

Our second main discovery is that this class of Markov chains can be explicitly diagonalized using 
the Eulerian idempotent and some combinatorics of the free associative algebra. This combinatorics 



is reviewed in Section 2.3 It leads to a description of the left eigenvectors (Theorems 3.14 and 



3.15) which is often interpretable and allows exact and asymptotic answers to natural probability 



questions. For a polynomial algebra, we are also able to describe the right eigenvectors completely 



(Theorem 3.18). 



Example 1.3 (Shuffling). For a deck of n distinct cards, the eigenvalues of the Markov chain 
induced by repeated riffle shuffling are 1, 1/2, . . . , l/2 n_1 |Han90j . The multiplicity of the eigenvalue 
l/2 n_ * equals the number of permutations in S n with i cycles. For example, the second eigenvalue, 



1/2, has multiplicity Q)- For 1 < i < j < n, results from Section 5 show that a right eigenvector 
fij is given by 

1, if z and j are adjacent in w in order ij, 
— 1, if i and j are adjacent in uu in order ji, 
0, otherwise. 



fij(w) 
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Summing in i < j shows that d(w) — ^ip- is an eigenvector with eigenvalue 1 /2 (d(w) = # descents 
in w). Similarly p(w) — is an eigenvector with eigenvalue 1/4 (p(w) = # peaks in w). These 
eigenvectors are used to determine the mean and variance of the number of carries when large 
integers are added. 

Our results work for decks with repeated values allowing us to treat cases when e.g., the suits do 
not matter and all picture cards are equivalent to tens. Here, fewer shuffles are required to achieve 
stationarity. For decks of essentially any composition we show that all eigenvalues 1/2* , < i < 
n — 1, occur and determine multiplicities and eigenvectors. 



Example 1.4 (Rock-breaking). Consider the rock-breaking process of Example 1.2 started at (n), 
the partition with a single part of size n. This is absorbing at the partition l n . In Section 4} this 
process is shown to have eigenvalues 1, 1/2, . . . , l/2 n_1 with the multiplicity of l/2 n ~ l the number 
of partitions of n into I parts. Thus, the second eigenvalue is 1/2 taken on uniquely at the partition 
1™ _2 2. The corresponding eigenfunction is 



/i-2(A) = £ ( 2 ) 



This is a monotone function in the usual partial order on partitions and equals zero if and only if 
A = l n . If Xq = (n), Xi, X%, . . . are the successive partitions generated by the Markov chain then 

E( n ) (A"-2 2 (X fc )) = 2jfe/ln-2 2 (Xo) = {Z] j 2 k . 

Using Markov's inequality, 



k 



P{X k is not absorbed} < rM / 2 
This shows that for k = 21og 2 n + c, the chance of absorption is asymptotic to 1 — 1/2 C+1 when n 



is large. Section 4 derives all of the eigenvectors and gives further applications. 



Section 2 reviews Markov chains (including uses for eigenvectors), Hopf algebras, and some 



combinatorics of the free associative algebra. |Section "3 gives our basic theorems, generalizing the 



two examples to polynomial Hopf algebras and cocommutative, free associative Hopf algebras. 
|Section 4] treats rock-breaking; Section 5] treats shuffling. |Section 6 briefly describes other examples 



(e.g., graphs and simplicial complexes), counter-examples (e.g., the Steenrod algebra), and questions 
(e.g., quantum groups). 

Two historical notes: The material in the present paper has roots in work of Patras |Pat91| 
Pat93, Pat94], whose notation we are following, and Drinfeld [Dri89j. Patras studied shuffling in 
a purely geometric fashion, making a ring out of polytopes in IR n . This study led to natural Hopf 
structures, Eulerian idempotents, and generalization of Solomon's descent algebra in a Hopf context. 
His Eulerian idempotent maps decompose a graded commutative or cocommutative Hopf algebra 
into eigenspaces of the ath Hopf-powers; we improve upon this result, in the case of polynomial 
algebras or cocommutative, free associative algebras, by algorithmically producing a full eigenbasis. 
While there is no hint of probability in the work of Patras, it deserves to be much better known. 
More detailed references are given elsewhere in this paper. 

We first became aware of Drinfeld's ideas through their mention in Shnider-Sternberg [SS93]. 
Consider the Hopf-square, acting on a Hopf algebra %. Suppose that x G H is primitive, A(x) = 
1 (8) x + x (8) 1. Then mA(x) = 2x so x is an eigenvector of mA with eigenvalue 2. If x and y are 
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primitive then mA(xy + yx) = 4(xy + yx) and, similarly, if x\, . . . , x^ are primitive then the sum 
of symmetrized products is an eigenvector of mA with eigenvector 2 k . Drinfeld [Dri89, Prop 3.7] 
used these facts without comment in his proof that any formal deformation of the cocommutative 
universal enveloping algebra U{q) results already from deformation of the underlying Lie algebra 
0. See [SS93, Sect. 3.8] and Section 3.4 below for an expanded argument and discussion. For us, 
a description of the primitive elements and their products gives the eigenvectors of our various 



Markov chains. This is developed in Section 3 



Acknowledgements: We thank Marcelo Aguiar, Federico Ardila, Nantel Bergeron, Dan Bump, 
Gunner Carlsson, Ralph Cohen, Peter J. McNamara and Susan Montgomery for their help and 
suggestions. 



2 Background 

This section gives notation and background for Markov chains (including uses for eigenvectors), 
Hopf algebras, the combinatorics of the free associative algebra and symmetric functions. All of 
these are large subjects and pointers to accessible literature are provided. 

2.1 Markov chains 

Let X be a finite set. A Markov chain on X may be specified by a transition matrix K(x,y) 
(x,y G X) with K(x,y) > 0, ^2yK{x,y) = 1- This is interpreted as the chance that the chain 
moves from x to y in one step. If the chain is denoted Xq, X\, X2, ■ ■ . and Xq = x$ is a fixed 
starting state then 

n-l 

P{Xi = xi, ■ ■ ■ , X n = x n } = Y[ K(xi, x i+1 ). 

i=0 

Background and basic theory can be found in [KT75] or [Brc09]. The readable introduction 
[LPW09J is recommended as close in spirit to the present paper. The analytic theory is devel- 
oped in [SC97] . 

Let K 2 (x,y) = Yl z K{x, z)K{z,y) denote the probability of moving from x to y in two steps. 
Similarly, K l is defined. Under mild conditions [LPW091 Sec 1.5] Markov chains have unique 
stationary distributions ir(x): thus ir(x) > 0, Ylx n ( x ) = ^> ^(x)K(x, y) = 7t(y), so tt is a left 
eigenvector of K with eigenvalue 1. Set 

L 2 (Tr) = {f:X^R} with (/i|/ 2 )=^/i(x)/ 2 (xMx). 

Then K operates as a contraction on L 2 with Kf(x) = K(x,y)f(y). The Markov chains 
considered in this paper are usually not self-adjoint (equivalently reversible), nonetheless, they are 
diagonalizable over the rationals with eigenvalues 1 = /?o > Pi > • • ■ > P\x\— 1 > 0- We have a 
basis of left eigenfunctions {gi} i=0 with go(x) = tt(x) and gi(x)K(x,y) = Pigi{y), and, in 
some cases, a dual basis of right eigenfunctions {/i}l=o 1 with fo(x) = 1, Kfi(x) = fiifi{x), and 
~Yl<x fi( x )dj( x ) = A s i s customary in discussions of random walks on algebraic structures, we 
will abuse notation and think of the eigenfunctions fi both as functions on the state space and as 
linear combinations of the states - in other words, fi(x)x will also be denoted /j. 

Throughout, we are in the unusual position of knowing fy, g% and possibly fi explicitly. This is 
rare enough that some indication of the use of eigenfunctions is indicated. 
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Use A For any function / : X — )■ R, expressed in the basis of right eigenfunctions fi as 



1*1-1 

(2-1) / = E 

the expectation of / after k steps, having started at xq, is given by 

1*1-1 

(2-2) E xo (f(X k )) = £ aiPifM- 

i=0 

For example, for shuffling, the normalized number of descents d(7r) — (n — l)/2 is the sum of the 



1/2-eigenfunctions for riffle shuffling; see Example 5.8 Thus, with xq = id and all k, < k < oo 



77 — 1 / 1 
Eld (d{X k )) = — [l-- k 

In [DF09b[ IDF09a] it is shown that the number of descents in repeated riffle shuffles has the 
same distribution as the number of carries when n integers are added. Further, the square of this 
eigenfunction has a simple eigenfunction expansion leading to simple formulae for the variance and 
covariance of the number of carries. 

Use B If / is a right eigenfunction with eigenvalue /3, then the self-correlation after k steps 
(starting in stationarity) is 

E 7V {f(X )f(X k )} = E 7r {E{f(X )f(X k )\X = x }} = p k E^ {/(X 2 )} . 

This indicates how certain correlations fall off and gives an interpretation of the eigenvalues. 

Use C For / a right eigenfunction with eigenvalue /3, let Yi = f (Xi) / '/3 i , < i < oo. Then Yj 
is an Ti martingale with T% = u (Xq, X±, . . . , Xi). One may try to use optional stopping, maximal 
and concentration inequalities and the martingale central limit theorem to study the behavior of 
the original Xi chain. 



Use D The left eigenfunctions come into computations since 9i(x)fj(x) = 5ij. Thus in (2.1), 
a i = (di\f 7 71 ")- (Here f /ir is just the density of / with respect to 7r.) 

Use E A second prevalent use of left eigenfunctions throughout this paper: the dual of a Hopf 
algebra is a Hopf algebra and left eigenfunctions of the dual chain correspond to right eigenfunctions 
of the original chain. This is similar to the situation for time reversal. If K*(x,y) = ^^K(y,x) 
is the time-reversed chain (note K*(x,y) is a Markov chain with stationary distribution ir), then 
gij-K is a right eigenfunction of K* . 

Use F The left eigenfunctions also come into determining the quasi-stationary distribution of 
absorbing chains such as the rock-breaking chain. A useful, brief introduction to quasi-stationarity 
is in |KT75j . The comprehensive survey |vD91] and annotated bibliography jPolllj are also useful. 
Consider the case where there is a unique absorbing state x, and the second eigenvalue j3\ of the 
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chain satisfies 1 = 0q > fix > fa > . . . . This holds for rock-breaking. There are two standard 
notions of "the limiting distribution of the chain given that it has not been absorbed" : 



(2.3a) 
(2.3b) 



vr 1 ^) 
vr 2 (x) 



lim P{X k 

k— >oo 



lim lim P{X k 

fc— i-OO I— >oo 



x\X k / x.}; 

x\Xi ^ x,}. 



In words, 7r 1 (a;) is the limiting distribution of the chain given that it has not been absorbed up to 
time k and 7T 2 (x) is the limiting distribution of the chain given that it is never absorbed. These 
quasi-stationary distributions can be expressed in terms of the eigenfunctions: 



(2.4) 



ttUx) 



9i{x) 



tt 2 (x) 



ai{x)fi{x) 



These results follow from simple linear algebra and are proved in the references above. For rock- 
breaking, results in Section 4 show that 7T 1 = it 2 is point mass at the partition 21 n ~ 2 . 



Use G Both sets of eigenfunctions appear in the formula 
(2.5) 



1*1-1 

K\x,y)= (3 l ifi(x) 9l (y). 

i=0 



This permits the possibility of determining convergence rates. It can be difficult to do for chains 
with large state spaces. See the examples and discussion in |DF12| . 

To conclude this discussion of Markov chains we mention that convergence is customarily mea- 
sured by a few standard distances: 



(2.6) Total variation 

(2.7) Separation 

(2.8) Sup 



\K l Xo - HItv = max \K l X(j (A) - n(A)\ = - ^ \K l (x ,y) - n(y)\ 

y 

K l (x ,y) 



sep Xo (I) = max 1 
loo(l) = max 



"' 7r(y) 

K l (x ,y) - n(y) 



Here \\K l XQ — vt||tv < se Px o (0 — fco(0 and all distances are computable by determining the maxi- 
mizing or minimizing values of A or y and using (2.5)-(2.8). See [LPW091 Lemma 6.13] for further 
discussion of these distances. 



2.2 Hopf algebras 

A Hopf algebra is an algebra T~L over a field k (usually the real numbers in the present paper). It 
is associative with unit 1, but not necessarily commutative. Let us write m for the multiplication 
in 7~L, so m(x (g) y) = xy. Then : Ti® a — > % will denote a-fold products (so m = mi^), formally 
777, W = m{i ® m' a_1 ^) where l denotes the identity map. 

H comes equipped with a coproduct A : % — > written A(x) = J2( x ) x (i)® x (2) h 1 Sweedler 

notation [Swe69] . The coproduct is coassociative in that 

(A ® t)A(x) = ^ ® x (1)(2) ® x (2 ) = ^ x(i) <8> x (2 ) (1) ® x (2 ) (2) = (t ® A)A(z) 

(x),(x (1) ) (?Ux m ) 
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so there is no ambiguity in writing A[ 3 ](x) = Yl( x ) ® x (2) ® x (3)- Similarly, A^ : % — > 
denotes the a-fold coproduct, where A is applied a — 1 times, to any one tensor-factor at each stage; 
formally A^ = (i (g) • ■ ■ <g> i ® A) A' 61-1 ] . The Hopf algebra % is cocommutative if Yl( x ) ® x (2) = 
X)(a;) x (2) ® in other words, an expression in Sweedler notation is unchanged when the indices 
permute. An element x of % is primitive if A{x) = 1 (g) a; + x (g> 1. 

The product and coproduct have to be compatible so A is an algebra homomorphism, where 
multiplication on is componentwise; in Sweedler notation this says A(xy) = Yl(x) (y) x (i)y(i)® 

X (2)U(2)- All of the algebras considered here are graded and connected, i.e., H = ©°^ %j with 
Tio = k and % n finite-dimensional. The product and coproduct must respect the grading so 
7-LiHj C Hi+j, and x G 7i n implies A(x) G ®"_ ?{j £3 %n-j- There are a few more axioms 
concerning a counit map and an antipode (automatic in the graded case); for the present paper, 
the most important is that the counit is zero on elements of positive degree, so, by the coalgebra 
axioms, A(x) := A(x) — 1® x — x ®1 G ©?=i Hj <8> Hn-j, for x G 7i n . The free associative 



algebra and the algebra of symmetric functions, discussed in Section 1 , are examples of graded 
Hopf algebras. 

The subject begins in topology when H. Hopf realized that the presence of the coproduct leads 
to nice classification theorems which allowed him to compute the cohomology of the classical groups 
in a unified manner. Topological aspects are still a basic topic [Hat02j with many examples which 
may provide grist for the present mill. For example, the cohomology groups of the loops on a 
topological space form a Hopf algebra, and the homology of the loops on the suspension of a wedge 
of circles forms a Hopf algebra isomorphic to the free associative algebra of Example |1.1| [BS53 . 

Joni and Rota [JR79J realized that many combinatorial objects have a natural breaking structure 
which gives a coalgebra structure to the graded vector space on such objects. Often there is a 
compatible way of putting pieces together, extending this to a Hopf algebra structure. Often, either 
the assembling or the breaking process is symmetric, leading to commutative or cocommutative 
Hopf algebras respectively. For example, the symmetric function algebra is commutative and 
cocommutative while the free associative algebra is just cocommutative. 

The theory developed here is for graded commutative or cocommutative Hopf algebras with 
one extra condition, that there is a unique way to assemble any given collection of objects. This 
amounts to the requirement that the Hopf algebra is either a polynomial algebra as an algebra 
(and therefore commutative) or a free associative algebra as an algebra and cocommutative (and 
therefore noncommutative). (We write a free associate algebra to refer to the algebra structure 
only, as opposed to the free associative algebra which has a specified coalgebra structure - namely, 
the generating elements are primitive.) 

Increasingly sophisticated developments of combinatorial Hopf algebras are described by [ABS06, 
Sch93, Sch94"l [Sch87[ ISch 95] and [AM10]. This last is an expansive extension which unifies many 
common examples. Below are two examples that are prototypes for their Bosonic Fock functor and 
Full Fock functor constructions respectively [AM10, Ch. 15]; they are also typical of constructions 
detailed in other sources. 

Example 2.1 (The Hopf algebra of unlabeled graphs). |Sch941 Sec. 12] [FislCH Sec. 3.2] Let Q be 
the vector space spanned by unlabeled simple graphs (no loops or multiple edges). This becomes a 
Hopf algebra with product disjoint union and coproduct 

A{G) = Y,Gs®G s c 

where the sum is over subsets of vertices S with Gs, G s c the induced subgraphs. Graded by 
number of vertices, Q is both commutative and cocommutative, and is a polynomial algebra as an 



algebra. The associated random walk is described in Example 3.1 below. 
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Example 2.2 (The noncommutative Hopf algebra of labeled graphs). }Sch941 Sec. 13] |FislO| 
Sec. 3.3] Let Q be the vector space spanned by the set of simple graphs with vertices are labeled 
{1, 2, . . . , n}, for some n. The product of two graphs G1G2 is their disjoint union, where the vertices 
of G\ keep their labels, and the labels in G2 are increased by the number of vertices in G\. The 
coproduct is 

A(G) =^G s ®G s c 

where we again sum over all subsets S of vertices of G, and Gs, G s c are relabeled so the vertices 
in each keep the same relative order. For example, 

A(Z.) = Z..i * 

\2 3 ' 2 3 2 



/ 



.1 



.1 
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where 1 denotes the empty graph. £/ is noncommutative and cocommutative and a free associative 



algebra as an algebra; the associated random walk is detailed in Example 3.2 As the notation 



suggests, Q is a quotient of Q, obtained by forgetting the labels on the vertices. 

Aguiar-Bergeron-Sottile [ABS06J define a combinatorial Hopf algebra as a Hopf algebra T-L 
with a character ( : % — > k which is both additive and multiplicative. They prove a universality 
theorem: any combinatorial Hopf algebra has a unique character-preserving Hopf morphism into the 
algebra of quasisymmetric functions. They show that this unifies many ways of building generating 
functions. When applied to the Hopf algebra of graphs, their map gives the chromatic polynomial. 



In Section 3.7 we find that their map gives the probability of absorption for several of our Markov 



chains. See also the examples in Section 6 



A good introduction to Hopf algebras is in |SS93] . A useful standard reference is in [Mon93j and 
our development does not use much outside of her Chapter 1. The broad-ranging text by |Maj95| 
is aimed towards quantum groups but contains many examples useful here. Quantum groups are 



neither commutative nor cocommutative and need special treatment; see Example 6.3 

A key ingredient in our work is the Hopf-square map ^f 2 = mA; ^ 2 (x) is also written x^ 2 h 
In Sweedler notation, ^> 2 {x) = Yl( x ) x (i) x (2)'-> in our combinatorial setting, it is useful to think 
of "pulling apart" x according to A, then using the product to put the pieces together. On 
graded Hopf algebras, V? 2 preserves the grading and, appropriately normalized, gives a Markov 



chain on appropriate bases. See Section 3.2 for assumptions and details. The higher power maps 
yj/a _ m la]^[a] w j-q a j SQ ^ e gtudied, since under our hypothesis, they present no extra difficulty. For 
example, *& 3 (x) = Yl(x) x (i) x {2) x (S)- in the shuffling example, ty a corresponds to the "a-shuffles" of 
[BD92j. A theorem of [TO70J shows that, for commutative or cocommutative Hopf algebras, the 
power rule holds: (x^)^ = x^ ah \ or ^i a ^ b = ^> ab . See also the discussion in |LMS06] . In shuffling 
language this becomes "an a-shufffe followed by a 6-shuffle is an afe-shuffle" [BD92J. In general Hopf 
algebras this power law often fails [KasOO] . Power maps are actively studied as part of a program 
to carry over to Hopf algebras some of the rich theory of groups. See [G M09| ILMOO] and their 
references. 
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2.3 Structure theory of a free associative algebra 

The eigenvectors of our Markov chains are described using combinatorics related to the free asso- 
ciative algebra, as described in the self-contained |Lot97l Chap. 5]. 

A word in an ordered alphabet is Lyndon if it is strictly smaller (in lexicographic order) than 
its cyclic rearrangements. So 1122 is Lyndon but 21 or 1212 are not. A basic fact |Lot97| Th. 5.1.5] 
is that any word w has a unique Lyndon factorization, that is, w = life ■ ■ ■ h with each Zj a Lyndon 
word and l\ > I2 > • • • > Ik- Further, each Lyndon word I has a standard factorization: if I is not a 
single letter, then I = I1I2 where l{ is non-trivial Lyndon and I2 is the longest right Lyndon factor 
of I. (The standard factorization of a letter is just that letter by definition.) Thus 13245 = 13 • 245. 
Using this, define, for Lyndon I, its standard bracketing recursively by X(a) = a for a letter 
and X(l) = [A(/i), A(^)] for I = lifa in standard factorization. As usual, [x,y] = xy — yx for words 
x, y. Thus 

A(13245) = [A(13), A(245)] = [[1, 3], [2, [4, 5]]] 

= 13245 - 13254 - 13452 + 13542 - 31245 + 31254 + 31452 - 31542 
- 24513 + 25413 + 45213 - 54213 + 24531 - 25431 - 45231 + 54231 

and 

A(1122) = [1,A(122)] = [1,[A(12),2]] 

= 1122 - 2(1212) + 2(2121) - 2211. 

[GR89j Sec. 2] describes how to visualize the standard bracketing of a Lyndon word as a rooted 
binary tree: given a Lyndon word I with standard factorization I = I1I2, inductively set 7} to be 
the tree with Ti x as its left branch and 7] 2 as its right branch. T13245 and T1122 are shown below. 



1 3 2 4 5 1 1 2 2 




T 13245 T 1122 

Observe that a word w appears in the expansion of A(Z) only if, after exchanging the left and 
right branches at some vertices of 2], the leaves of TJ, when read from left to right, spell out w. 
The coefficient of w in is then the signed number of ways to do this (the sign is the parity of 
the number of exchanges required). For example, 

• 25413 has coefficient 1 in A(13245) since the unique way to rearrange T13245 so the leaves spell 
25413 is to exchange the branches at the root and the highest interior vertex; 
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• 21345 does not appear in A(13245) since whenever the branches of T13245 switch, 2 must 
appear adjacent to either 4 or 5, which does not hold for 21345; 

• 1221 has coefficient in A(1122) as, to make the leaves of T1122 spell 1221, we can either 
exchange branches at the root, or exchange branches at both of the other interior vertices. 
These two rearrangements have opposite signs, so the signed count of rearrangments is 0. 

A final piece of notation is the following symmetrized product: let w = life ■ ■ - Ik in Lyndon 
factorization. Then set 

sym(u;) = ^ A(/ <t(1) )A(/ <t(2 )) • • • A(/ CT ( fc) ). 

aes k 



Viewing sym(u>) as a polynomial in the letters w\, W2, ■ ■ ■ , wi will be useful for Theorem 3.15 



Garsia's tree construction can be extended to visualize sym(w), using what Barcelo and Berg- 
eron [BB90| call decreasing Lyndon hedgerows, which simply consist of Ti x , TJ 2 , . . . , T[ k placed in a 
row. Denote this as T w also. The example T35142 is shown below. 




T 35142 



We can again express the coefficient of w' in sym(u>) as the signed number of ways to rearrange 
T w so the leaves spell w'. Now there are two types of allowed moves: exchanging the left and right 
branches at a vertex, and permuting the trees of the hedgerow. The latter move does not come 
with a sign. Thus 14253 has coefficient —1 in sym(35142), as the unique rearrangement of T35142 
which spells 14253 requires transposing the trees and permuting the branches labeled 3 and 5. 

It is clear from this pictorial description that every term appearing in sym(u>) is a permutation 
of the letters in w. [GR89J Th. 5.2] shows that sym(u') form a basis for a free associative algebra. 
This will turn out to be a left eigenbasis for inverse riffle shuffling, and similar theorems hold for 
other Hopf algebras. 

2.4 Symmetric functions and beyond 

A basic object of study is the vector space A^ of homogeneous symmetric polynomials in k 
variables of degree n. The direct sum A^ = tfe f° rms a graded algebra with familiar 

bases: the monomial (m\), elementary (e\), homogeneous (h\), and power sums (p\). For ex- 
ample, e 2 (xi,-- - ,Xk) = Yli<i<j<k x i x j an d f° r a partition A = Ai > A2 > • • • > A; > with 
Ai + • • • + A; = n, e\ = e\ x e\ 2 ■ ■ ■ e\ r As A ranges over partitions of n, {e\} form a basis for A£, 
from which we construct the rock-breaking chain of Example |1.2| Splendid accounts of symmetric 
function theory appear in [Mac95j and [Sta99j. A variety of Hopf algebra techniques are woven 



10 



into these topics, as emphasized by |Gei77] and [Z el81| . The comprehensive account of noncom- 
mutative symmetric functions [GKL + 95] and its follow-ups furthers the deep connection between 
combinatorics and Hopf algebras. However, this paper will only involve its dual, the algebra of 
quasisymmetric functions, as they encode informations about absorption rates of our chains, see 



Section 3.7 A basis of this algebra is given by the monomial quasisymmetric functions: for a 
composition a = (a\, . . . , a^), define M a = X^i<i 2 <-<i x t X ' ' ' X T ■ Further details are in (5ta99, 
Sec. 7.19]. 

3 Theory 

3.1 Introduction 



This section states and proves our main theorems. This introduction sets out definitions. Section 3.2 
develops the reweighting schemes needed to have the Hopf-square maps give rise to Markov chains. 
Section 3.3 explains that these chains are often acyclic. Section 3.4| addresses a symmetrization 



lemma that we will use in Section 3.5 and Section 3.6 to find descriptions of some left and right 



eigenvectors respectively for such chains. |Section 3.7 determines the stationary distributions and 



gives expressions for the chance of absorption in terms of generalized chromatic polynomials. Ap- 
plications of these theorems are in the last three sections of this paper. 



As mentioned at the end of Section 2.2 , we will be concerned with connected, graded (by positive 
integers) Hopf algebras T~L with a distinguished basis B satisfying one of two "freeness" conditions 
(in both cases, the number of generators may be finite or infinite): 

1. % = R [ci, C2, . . . ] as an algebra (ie H is a polynomial algebra) and B = {c™ 1 ^ 2 . . . |n, £ N}, 
the basis of monomials. The q may have any degree, and there is no constraint on the 
coalgebra structure. This will give rise to a Markov chain on combinatorial objects where 
assembling is symmetric and deterministic. 

2. Ti is cocommutative, H = K (c±, c%, . . . ) as an algebra, (ie T~L is a free associative algebra) and 
B = {ci 1 Ci 2 . . . \ij £ N}, the basis of words. The Cj may have any degree, and do not need to 
be primitive. This will give rise to a Markov chain on combinatorial objects where pulling 
apart is symmetric, assembling is non-symmetric and deterministic. 

By the Cartier-Milnor-Moore theorem |MM65l (^ar07| . any graded connected commutative Hopf 
algebra has a basis which satisfies the first condition. However, we will not make use of this, since 
the two conditions above are reasonable properties for many combinatorial Hopf algebras and their 
canonical bases. For example, the Hopf algebra of symmetric functions, with the basis of elementary 
symmetric functions e\, satisfies the first condition. 

Write Tin for the subspace of degree n in T~L, and B n for the degree n basis elements. The 
generators Ci can be identified as those basis elements which are not the non-trivial product of 
basis elements; in other words, generators cannot be obtained by assembling objects of lower degree. 
Thus, all basis elements of degree one are generators, but there are usually generators of higher 



degree; see Examples 3.1 and 3.2 below. One can view the conditions 1 and 2 above as requiring the 
basis elements to have unique factorization into generators, allowing the convenient view of b E B as 
a word b = c\Ci • • • q. Its length 1(b) is then well-defined - it is the number of generators one needs 



to assemble together to produce b. Some properties of the length are developed in |Section For 
a noncommutative Hopf algebra, it is useful to choose a linear order on the set of generators refining 
the ordering by degree: i.e. if deg(c) < deg(c'), then c < c'. This allows the construction of the 
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Lyndon factorization and standard bracketing of a basis element, as in Section 2.3 Example 3.16 
demonstrates such calculations. 

The ath Hopf-power map is 1 f a := m^A'"!, the a-fold coproduct followed by the a-fold prod- 
uct. These power maps are the central object of study of |Pat91} IPat93j IFat94j . Intuitively, *f? a 
corresponds to breaking an object into a pieces (some possibly empty) in all possible ways and then 
reassembling them. The ^ a preserve degree, thus mapping % n to H n . 

As noted in |Pat93| , the power-map *S> a is an algebra homomorphism if % is commutative: 

ty a (xy) = m [a] x (l)y(i)®---® x (a)y(a) 
(*),(v) 

= ■■■ X (a)y(a) = Y X (l)--- X (a)y(l)---y(a)=^ a (x)^ a (y); 

( x ),(y) 0),(j/) 
and a coalgebra homomorphism if % is cocommutative: 

® m a ) (Ax) = J2^ a ® * a ( x (2)) 
(*) 

= Yj ' • • X ( a ) ® ^C^ 1 ) • • • X ( 2a ) = ^2 X W X (3) • • • X {2a-1) ® X {2) X {A) ■ ■ ■ x (2a) = 
(x) (x) 

Only the former will be necessary for the rest of this section. 
3.2 The Markov chain connection 

The power maps can sometimes be interpreted as a natural Markov chain on the basis elements B n 
o£Hn. 



Example 3.1 (The Hopf algebra of unlabeled graphs, continuing from Example 2.1). The set of 



all unlabeled simple graphs gives rise to a Hopf algebra Q with disjoint union as product and 

A(G) = Y,G s ®G s c 

where the sum is over subsets of vertices S with Gs, G s c the induced subgraphs. Graded by the 
size of the vertex set, Q is a commutative and cocommutative polynomial Hopf algebra with basis 
B consisting of all graphs. The generators are precisely the connected graphs, and the length of a 
graph is its number of connected components. 

The resulting Markov chain on graphs with n vertices evolves as follows: from G, color the 
vertices of G red or blue, independently with probability 1/2. Erase any edge with opposite colored 
vertices. This gives one step of the chain; the process terminates when there are no edges. The 
analogous Hopf algebra of simplicial complexes is discussed in |Section 6 



Example 3.2 (The noncommutative Hopf algebra of labeled graphs, continuing from Example 2.2 ). 
Let Q be the linear span of the simple graphs whose vertices are labeled {1,2, ... ,n} for some n. 
The product of two graphs G1G2 is their disjoint union, where the vertices of G\ keep their labels, 
and the labels in G2 are increased by the number of vertices in G\. The coproduct is 



&(G) = Y,G s ®G s c 



where the sum again runs over all subsets S of vertices of G, and Gs, G s c are relabeled so 
the vertices in each keep the same relative order. An example of a coproduct calculation is in 
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Example 2.2 Q is cocommutative and a free associative algebra; its distinguished basis B is the 



set of all graphs. A graph in Q is a product if and only if there is an i such that no edge connects 
a vertex with label < i to a vertex with label > i. Thus, all connected graphs are generators, but 
there are non-connected generators such as 



1 



Each step of the associated random walk on B n , the graphs with n vertices, has this description: 
from G, color the vertices of G red or blue, independently with probability 1/2. Suppose r vertices 
received the color red; now erase any edge with opposite colored vertices, and relabel so the red 
vertices are 1, 2, . . . , r and the blue vertices are r + 1, r + 2, . . . , n, keeping their relative orders. For 
example, starting at the complete graph on three vertices, the chain reaches each of the graphs 
shown below with probability 1/8: 





So, forgetting the colors of the vertices, 





I 2 3 



with probability 1/4 
witli probability 3/8 



with probability 3/8 



l 2 3 



As with Q, the chain on Q n stops when all edges have been removed. 

When is such a probabilistic interpretation possible? To begin, the coefficients of mA(6) must 
be non-negative real numbers for b G B. This usually holds for combinatorial Hopf algebras, 
but the free associative algebra and the above algebras of graphs have an additional desirable 
property: for any b G B, the coefficients of ^ 2 (6) sum to 2 des ^ b \ regardless of b. Thus the operator 
2^r v I /2 (6) = ^2 b , K(b, b')b' forms a Markov transition matrix on basis elements of degree n. Indeed, 
the coefficients of *$> a (b) sum to a deg ^ for all a, so ^^ a (b) = K a (b,b')b' defines a transition 
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matrix K a . For other Hopf algebras, the sum of the coefficients in ^> 2 (b) may depend on b, so 
simply scaling \E' 2 does not always yield a transition matrix. 

[Zho08f Lemma 4.4.1.1] provides a solution: if K is a matrix with non-negative entries and <j> is 
a strictly positive right eigenfunction of K with eigenvalue 1, then K(b,b') := 4>{b)~ l K(b,b')4>{b') 
is a transition matrix. Here K is the conjugate of K by the diagonal matrix whose entries are 



Theorem 3.4 below gives conditions for such <jj to exist, and constructs explicitly. The following 




example explains why this construction is natural. 

Example 3.3 (Symmetric functions and rock-breaking). Consider the algebra of symmetric func- 
tions with basis {e A }, the elementary symmetric functions. The length l{e\) is the number of parts 
in the partition A, and the generators are the partitions with a single part. The coproduct is defined 
by 

i 

A(ej) = e j ® e i-j so 

j=0 

A(e A ) = A(e Al ) • • • A(e A; ) = (j^ ^ ® ^i-fcl ' ' 

\h=o / 

= ^2 ey <8> e X -\> 
\'<x 

with the sum over all compositions A' = A' l5 A' 2 , . . . , A^ with < A^ < Aj, and A— A' is the composition 
Ai — Aj, . . . , Xi — Aj. When reordered, some parts may be empty and some parts may occur several 
times. There are (Ai + 1) • • • (A/ + 1) possible choices of A', so the coefficients of \£ 2 (e A ) sum to 
(Ai + 1) • • • (A; + 1), which depends on A. 

Consider degree 2, where the basis elements are and e2- For K such that ^^ 2 (6) = 
Zb>K(b,b')b', 

K- 1 ° 

1 I ' 

4 2 

which is not a transition matrix as the second row does not sum to 1. Resolve this by performing 
a diagonal change of basis: set = (^(e^ )~ 1 e 1 2, e>2 = §{z2)~^&2 for some non-negative function 
(j) : B ->• R, and consider K with ^ 2 (b) = J2~ b > ^( fo > b ') h ' ■ 

Since the first row of K, corresponding to 
e^, pose no problems, set (/'(e^) = 1. In view of the upcoming theorem, it is better to think of this 
as 4>(ei2) = (4>(ei)) 2 with 4>{e\) = 1. Equivalently, = e 2 with e\ = e\. Turning attention to the 
second row, observe that A(e=2) = 4>(e2)~ 1 (e2® 1 + ei ® ei + 1(8) e%), so \£ 2 (e2) = e2 + </>(e2) _1 e 1 2 +£2, 
which means 

K- 1 ° 

" ^(e 2 )- 1 I ' 

so K is a transition matrix if \4>( e 2)~ 1 + \ = 1, i-e. if ^(e2) = 3- 

Continue to degree 3, where the basis elements are e±3, e\2 and e^. Now define K such that 
2 k* 2 (b) = ZbiK(b,b')b'; 

1 



K= \ i 
1 



1 

4 4 



Again, look for (^(e^), ^(ei2) and 4>{ez) so that ET, defined by ^^ 2 {b) = K (b, b )b , is a 
transition matrix, where = (/>(e 1 3) _1 e 1 3, £12 = <A(ei2) _1 ei2, £3 = (/>(e3) _1 e3. Note that, tak- 
ing 4>(ei'i) = (0(ei)) 3 = 1 and 0(ei2) = (ft(e2)(f)(ei) = \, the first two rows of K sum to 1. 
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View this as = ef and e\i = e?,&\- Then, as ^(e^) = (j)(e$) 1 (e% + e^ex + ^x^i + 63) 
7^(e 3 ) _1 e2,i + \i 



63 + \4>{ez) 1 e2,i + 2<K e 3) ^2,1 + e3, the transition matrix is given by 



1 

k= hi 

Uies)- 1 \ 



and choosing 4>(e^) = g makes the third row sum to 1. 

Continuing, we find that </>(ej) = 4, so e« = i!ej, more generally, c»a = II(*D ^ e A with i 
appearing aj(A) times in A. Then, for example, 



mA (e n ) = ti!mA(e n ) = nlm^e, (8) e n _j = ^ ( ™ )ej<2 n _j. 



i=0 i=0 

So, for any partition A of n, 

mA (ex) = mA (e Al ) • • • mA (e Xn ) 



A'<A 



and the coefficients of toA(<5a) sum to J2\><\ (y|) " " " (y) = 2 Al • • • 2 An = 2 n , irrespective of A. Thus 
imA describes a transition matrix, which has the rock-breaking interpretation of 



Section 1 



The following theorem shows that this algorithm works in many cases. Observe that, in the 
above example, it is the non-zero off-diagonal entries that change; the diagonal entries cannot be 
changed by rescaling the basis. Hence the algorithm would fail if some row had all off-diagonal 
entries equal to 0, and diagonal entry not equal to 1. This corresponds to the existence of b G B n 
with 27T^ /2 (6) = ab for some the condition A(c) := A(c) - l®c-c®1^0 below precisely 

prevents this. Intuitively, we are requiring that each generator of degree greater than one can be 



broken non-trivially. For an example where this condition fails, see Example 6.5 



Theorem 3.4 (Basis rescaling). Let % be a graded Hopf algebra over R which is either a polynomial 
algebra or a free associative algebra that is cocommutative. Let B denote the basis of monomials in 
the generators. Suppose that, for all generators c with deg(c) > 1, all coefficients of A(c) (in the 
B®B basis) are non-negative and A(c) 7^ 0. Let K a be the transpose of the matrix of a~ n ^ a with 
respect to the basis B n ; in other words, a~ n ^ a (b) = Y2,b' K a {b,b')b' (suppressing the dependence of 
K a on n). Define, by induction on degree, 



c = c if deg(c) = 1 

I — 2 1_de g( c ) 
d= E b ^ c <P(b)K2(c,b) C 
b = c\ . . . q for b € B with factorization into generators c\. . . q 



for a generator c with deg(c) > 1 



where <j)(b) satisfies b = <p(b)b. Write B := {b\b G B} and B n := {b\b E B n }. Then the matrix 
of the ath power map with respect to the B n basis, when transposed and multiplied by a~ n , is a 
transition matrix. Ln other words, the operator K a on H n , defined by a~ n ^ a (b) = Y2b' Ka(b, b )b = 
£ 6 , ^(6)~ 1 K(6, b')(j)(b')b' , has K a (b, V) > and K a (b, V) = 1 for all b G B n , and all a > and 
n > (the same scaling works simultaneously for all a). 
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Remarks. 1. Observe that, if b = xy, then the definition of b ensures b = xy. Equivalently, <fi is 
a multiplicative function. 

2. The definition of c is not circular: since % is graded with Hq = R, the counit is zero on 
elements of positive degree so that A(c) G ©J S ! (c) 1 n j ®n dLeg{c) - j . Hence X 2 (c,6) is non- 
zero only if 6 = c or 1(b) > 1, so the denominator in the expression for c only involves 0(6) 
for b with > 1. Such b can be factorized as b = xy with deg(x), deg(y) < deg(fe), whence 
4>(b) = 4>(x)4>(y), so c only depends on 4>(x) with deg(x) < deg(c). 

Proo/. First note that K 2 (c,c) = ^(c)- 1 K 2 (c, c)<j)(c) = K 2 (c,c) = 2 1 ~ dc s( c ), since mA(c) = 2c + 
mA(c) and A(c) G 0^1^ 1 "Hj <8> %dcg(c)-j means no c terms can occur in mA(c). So 

££ 2 (c,6') = 2 1 - dc ^) + 0( c )-i^ K 2 (c,V)<Kl/) 

V b'+c 

1 _ 2 1— deg(c) 

= 1, 

as desired. 

Let r]c V denote the coefficients of A(c) in the B <g> B basis, so A(c) = ^ x r/c^x <8> y. Then 
K 2 (c,6) = 2- dc s( c )E a;j/=b ^, and 

k 2 ( c ,6) = 2- dc §( c ) £ ^(cy^y^b) 

xy=b 
xy=b 

So, if b has factorization into generators b = c\ . . . ci , then 
A(6) = A( Ci )...A(q) 

lc\ ■ ■ • 'lei 



£ //,'•'' ...//;' i in ...in- 



xi,...,xi,yi,...,yi 



SO 



Thus 



K 2 (b, b') = 2- dc ^ £ ^ci V1 ■ ■ ■ < m • 

x 1 ...xiy 1 ...y i =b' 



££2(6,6') = 2- dc ^)£^)- 1 K 2 (6,6O0(6') 



2 - deg(6) £ 0(6)" 1 < 1 « 1 . . . . . . X lVl ... Vl ) 



xi,...,xi,yi,...,yi 

I 

= n 2 " dcs(c ° £ ttcir^WxiMvi) 

i=l 

= nE^(c,6) 

i=l 6' 
= 1 
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as desired, where the third equality is due to multiplicativity of <j>. 

The above showed each row of K2 sums to 1, which means (l,l,...,l)isa right eigenvector of 
K2 of eigenvalue 1. K a describes ^> a in the B basis , wh ich is also a basis of monomials/ words, in a 



rescaled set of generators c, so, by Theorems 



3.18 



and 



3.19 



the eigenspaces of K a do not depend 



on a. Hence (1, 1, ... , 1) is a right eigenvector of K a of eigenvalue 1 for all a, thus each row of K a 
sums to 1 also. 

Finally, to see that the entries of K a are non- negative, first extend the notation r]c V so A^ a \c) = 
E 6l ,...b a ^'""'Hl®- • •»&«. As AM = (t<g>- ••<8)t(8)A)A[ - 1 ], it follows that r ?c bl -" 6a = ^ x r l h c 1 '-' ba - 2 ' x r 1 b x a 
which inductively shows that rjc 1, '" ,ba > for all generators c and all bi E i3. So, if b has factorization 
into generators b = c\ . . . c\ , then 



> fe l,<x 



>o, 



where the sum is over all sets {bij} i=1 J=1 such that the product 61,162,1 
b'. Finally, K a (b,b') = ct>(b)- 1 K a (b,b ; )cb(b r ) > 0. 



■bub 



1,2 • 



bin 



□ 



3.3 Acyclicity 



Observe that the rock-breaking chain (Examples 1.2 and 3.3) is acyclic - it can never return to a 



state it has left, because the only way to leave a state is to break the rocks into more pieces. More 
specifically, at each step the chain either stays at the same partition or moves to a partition which 
refines the current state; as refinement of partitions is a partial order, the chain cannot return to a 



state it has left. The same is true for the chain on unlabeled graphs (Example 3.1 ) - the number of 
connected components increases over time, and the chain never returns to a previous state. Such 
behavior can be explained by the way the length changes under the product and coproduct. (Recall 
that the length 1(b) is the number of factors in the unique factorization of b into generators.) Define 
a relation on B by b — > b' if b' appears in ty a (b) for some a. If *$> a induces a Markov chain on B n , 
then this precisely says that b' is accessible from b. 

Lemma 3.5. Let b,bi,b^ be monomials/words in a Hopf algebra which is either a polynomial 
algebra or a free associative algebra that is cocommutative. Then 

(1) l(b 1 ...b a )=l(b l ) + --- + l(b a ); 

(ii) For any summand bn^ (g> • • • (g> bu\ in A^(b), I (6(1)) +••• + / (&( a )) — Kb)', 

(iii) ifb->V, then l(b') >l(b). 



Proof, (i) is clear from the definition of length. 

Prove (ii) by induction on 1(b). Note that the claim is vacuously true if b is a generator, as 
each I > 0, and not all I (&(,)) may be zero. If b factorizes non-trivially as b = xy, then, as 

A^(b) = A^(x) AM (y) , it must be the case that br^ = xi^yu^ for some a^m <8) ■ • -®x^ in A^(x), 

2/(1) ® ■ ■ ■ ® I/(o) in A [a] (y). So I +••• + / (6 (o) ) = I {x {1) ) +••• + / (x (a) ) + / +■■■ + ! (y (a) ) 
by (i), and by inductive hypothesis, this is at least l(x) +l(y) = 1(b). 

(iii) follows trivially from (i) and (ii): if b — > b' , then b' = bru ■ ■ ■ br a -\ for a term b^ • • • <8> 6( a ) 
in AW(6). So l(b') = I (b {1) ) +■■■ + I (b (a) ) >l(b). □ 

If T~L is a polynomial algebra, more is true. The following proposition explains why chains built 
from polynomial algebras (i.e., with deterministic and symmetric assembling) are always acyclic; 
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in probability language, it says that, if the current state is built from I generators, then, with 
probability a l ~ n , the chain stays at this state, otherwise, it moves to a state built from more 
generators. Hence, if the states are totally ordered to refine the partial ordering by length, then 
the transition matrices are upper-triangular with a l ~ n on the main diagonal. 

Proposition 3.6. (Acyclicity) Let % be a Hopf algebra which is a polynomial algebra as an algebra, 
and B its monomial basis. Then the relation — > defines a partial order on B, and the ordering by 
length refines this order: ifb—>b' and b ^ b' , then 1(b) < l(b'). Furthermore, for any integer a and 
any b £ B with length 1(b), 

^> a (b) = a m b + a bb' b ' 

l(b')>l(b) 

for some aw . 

Proof. It is easier to first prove the expression for ^> a (b). Suppose b has factorization into gen- 
erators b = c\C2 ■ ■ ■ Chi,) ■ As H is commutative, *$> a is an algebra homomorphism, so *$> a (b) = 

VP ( Cl )...^ (c l(b) ). 



H 



Recall from 
in other words, 1 (g) c and c 



Section 2.2 



that A(c) = A(c) - 1 <g> c - c ® 1 e ®Zi Hi ® 
deg(c)—ii m otner woras, i o$ c ana c 1 are the only terms in A(c) which have a tensor-factor 
of degree 0. As A® = (l (g A) A, the only terms in Al 3 l(c) with two tensor-factors of degree are 
1 (g) 1 (g) c, 1 ® c (g) 1 and c (g> 1 (X 1 1. Inductively, we see that the only terms in Al a l(c) with all but 
one tensor-factor having degree are 1 ® • • • <g> 1 ® c, l®---<g>l<g>c®l,...,c<g>l®---<g>l. So 
ty a (c) = flC +Ej(j')>i ctcb'b' for generators c. As fy a (b) = ^f a (ci) ...^ r ° (q), and length is multiplica- 
tive (Lemma |3.5| (i)), the expression for ^> a (b) follows. 

It is then clear that —> is reflexive and antisymmetric. Transitivity follows from the power rule: 
If b —7- b' and b' — > b" , then b' appears in ty a (b) for some a and b" appears in ^ a (b 1 ) for some a! . 
So b" appears in y a '^ a (b) = ^ a ' a (b). □ 

The same argument applied to a cocommutative free associative algebra shows that all terms 
in fy a (b) are either a permutation of the factors of b, or have length greater than that of b. The 
relation — > is only a preorder; the associated chains are not acyclic, as they may oscillate between 
such permutations of factors. For example, in the noncommutative Hopf algebra of labeled graphs, 
the following transition probabilities can occur: 




1 2 3 



(the bottom state is absorbing) . The probability of going from b to some permutation of its factors 
(as opposed to to a state of greater length, from which there is no return to b) is a l ^~ n . 



Here is one more result in this spirit, necessary in Section 3.5 to show that the eigenvectors 



constructed there have good triangularity properties and hence form an eigenbasis: 

Lemma 3.7. Let b, 6j, b[ be monomials /words in a Hopf algebra which is either a polynomial algebra 
or a free associative algebra that is cocommutative. If b = b\ . . .bk and bi — > b\ for each i, then 
b^b' a{l) ...b' a{k) for anyaeS k . 
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Proof. For readability, take k = 2 and write b = xy, x — > x', y — > y'. By definition of the relation 
— it must be that x' = xm ■ ■ ■ xi a \ for some summand xry\ ® ■ ■ ■ ® X( a \ of A^(x). Likewise 
y' = y<i\ . . . yi a <) for some a! . Suppose a > a! . Coassociativity implies that A^ a \y) = (t (g) • • • ® i ® 
A[ a_a '])A[ a '](y), and yr a /\ <g) 1 ® • • • ® 1 is certainly a summand of Al a ~ a 'l (y/ a n), so yn\ <g) ■ ■ •®y( a ') ® 
1 ® • • • 1 occurs in A^ a \y). So, taking y( a '+i) = • • • = y( ) = 1, we can assume a = a'. Then 
At a l(6) = A^(x)A^(y) contains the term xmym &> • • • ® xr a \y( a y Hence *& a (6) contains the term 
xmym • • • Xf a \yr a \, and this product is x'y' if % is a polynomial algebra. 

If H is a cocommutative, free associative algebra, the factors in xmyru • • • Xf a \yi a -\ must be 
rearranged to conclude that b —> x'y' and b — > y'x' . Coassociativity implies A^ = (A®- • -(S>A)A[ a l, 
and A (a; (»)?/(»)) = A A (y (i) ) contains (x (i) (g) l) (l (g> = x^®y^, so Al 2a l(6) contains the 
term aim (gym ®X(2)® 2/(2)®" ■ •0^(a) ( 8 ) y(a)- As is cocommutative, any permutation of the tensor- 
factors, in particular, X(i)®X(2)<8>- • ■®X( a ) ( 3y(i) ( 3- ' -®2/(a) and 2/(i)<8>2/(2)<8>- ' '®V(a) • -®a;( ), 
must also be summands of A^- 2a \b), and multiplying these tensor-factors together shows that both 
x'y' and y'x' appear in ^ 2a \b). □ 



Example 3.8 (Symmetric functions and rock-breaking). Recall from Example 3.3 the algebra of 



symmetric functions with basis {e\}, which induces the rock-breaking process. Here, e\ — > e\i if 



and only if A' refines A. Lemma 3.7 for the case k = 2 is the statement that, if A is the union of 



two partitions n and u, and \i! refines /i, v' refines u, then jj! H v' refines [i H v = A. 
3.4 The symmetrization lemma 



The algorithmic construction of left and right eigenbases for the chains created in Section 3.2 will 
go as follows: 

(i) Make an eigenvector of smallest eigenvalue for each generator c; 

(ii) For each basis element b with factorization C1C2...Q, build an eigenvector of larger eigenvalue 
out of the eigenvectors corresponding to the factors c«, produced in the previous step. 

Concentrate on the left eigenvectors for the moment. Recall that the transition matrix K a 
is defined by a~ n ^f a {b) = £\/ K a (b,b')b' so the left eigenvectors for our Markov chain are the 
usual eigenvectors of \I ,a onH. Step (ii) is simple if H is a polynomial algebra, because then H is 
commutative so is an algebra homomorphism. Consequently, the product of two eigenvectors is 
an eigenvector with the product eigenvalue. This fails for cocommutative, free associative algebras 
7~L, but can be fixed by taking symmetrized products: 

Theorem 3.9 (Symmetrization lemma). Let xi,X2, • • • ,Xk be primitive elements of any Hopf al- 
gebra %, then J2 a( z Sk x a ^x a (2) ■ ■ ■ x a (k) * s an eigenvector of ^ a with eigenvalue a k . 
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Proof. For concreteness, take a = 2. Then 



mA 



^ x a(l) x o{2) ■ ■ - x a{k) 




[Ax 



a{k)) 



in | ^ + ^(i)) . . . {x a(k) (8 1 + 1® 

m z e n ® n 

AinA 2 ={i,2,...,fc} o-es fe ieAi jeA 2 

| Ai n A 2 = {1, 2, . . . , fc}}| ^ 

^ff(l) • • • x a(k) 



aes k 



2 k £ ^(i) • • • 



<T(fc) 



aes k 



□ 



In Section 3.5 and Section 3.6, the fact that the eigenvectors constructed give a basis will follow 



from triangularity arguments based on Section 3.3 These rely heavily on the explicit structure of 
a polynomial algebra or a free associative algebra. Hence it is natural to look for alternatives that 
will generalize this eigenbasis construction plan to Hopf algebras with more complicated structures. 
For example, one may ask whether some good choice of Xj exists with which the symmetrization 
lemma will automatically generate a full eigenbasis. When % is cocommutative, an elegant answer 
stems from the following two well-known structure theorems: 

Theorem 3.10 (Cartier-Milnor-Moore). JMM65, C arO^ IfHis graded, cocommutative and con- 
nected, then H is Hopf isomorphic to U(g), the universal enveloping algebra of a Lie algebra q, 
where q is the Lie algebra of primitive elements ofH. 

Theorem 3.11 (Poincare-Birkoff-Witt). \Hum7S\ \Lot91§ . If xi, x%, ... is a basis for a Lie algebra 
q, then the symmetrized products ^2 a&Sk Xi am Xi a(2) ...Xi a(h) , for 1 < i\ < 12 < • • • < ik, form a basis 
forU(g). ' " 

Putting these together reduces the diagonalization of ^ a on a cocommutative Hopf algebra to 
determining a basis of primitive elements: 

Theorem 3.12 (Strong symmetrization lemma). Let % be a graded, cocommutative, connected 
Hopf algebra, and let xi, X2, ... be a basis for the subspace of primitive elements in %. Then, for 
each fcsN, 



E 



x i, m x U2)- x U k) \ l <»i <*2 < ••• <«* 



is a basis of the a k -eigenspace of^ a . 

Much work [FislO, AS05, AS06 has been done on computing a basis for the subspace of the 
primitives of particular Hopf algebras, their formulas are in general more efficient than our universal 
method here, and using these will be the subject of future work. Alternatively, the theory of good 
Lyndon words [LR95] gives a Grobner basis argument to further reduce the problem to finding 
elements which generate the Lie algebra of primitives, and understanding the relations between 



them. This is the motivation behind our construction of the eigenvectors in Theorem 3.15, although 



the proof is independent of this theorem, more analogous to that of Theorem 3.14 the case of a 
polynomial algebra. 
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3.5 Left eigenfunctions 

This section gives an algorithmic construction of an eigenbasis for the Hopf power maps \£ a on the 
Hopf algebras of interest. If K a as defined by a~ n ^ a (b) = Y^v Ka(P, b')b' is a transition matrix, then 
this eigenbasis is precisely a left eigenbasis of the associated chain, though the results below stand 
whether or not such a chain may be defined (e.g., the construction works when some coefficients 
of A(c) are negative, and when there are primitive generators of degree > 1). The first step is to 
associate each generator to an eigenvector of smallest eigenvalue, this is achieved using the (first) 
Eulerian idempotent map 

(_l)«-i 



e(x) = 

a>l 

N n-1 



Here A(x) = A(x) — 1 ®x — x <8> 1 G ©?=i ® Hn-j > as explained in Section 2.2 Then inductively 



define A^ = (l ® • • • ® t ® A)A[ a_1 ], which picks out the terms in A^(x) where each tensor-factor 
has strictly positive degree. This captures the notion of breaking into a non-trivial pieces. Observe 
that, if x G H n , then A^x) = whenever a > n, so e(x) is a finite sum for all x. (By convention, 
e = on Hq.) 

This map e is the first of a series of Eulerian idempotents ej defined by Patras [Pat 93]; he proves 
that, in a commutative or cocommutative Hopf algebra of characteristic zero where A is locally 
nilpotent (i.e. for each x, there is some a with A\ a 'x = 0), the Hopf-powers are diagonalizable, and 
these e, are orthogonal projections onto the eigenspaces. In particular, this weight decomposition 
holds for graded commutative or cocommutative Hopf algebras. We will not need the full series 
of Eulerian idempotents, although Example 3.17| makes the connection between them and our 



eigenbasis. 

To deduce that the eigenvectors we construct are triangular with respect to B needs one crucial 



observation (recall from Section 3.3 that b — > b' if b' occurs in \I/ a (6) for some a): 



Proposition 3.13. For any generator c, 



e(c) = c + a cb >b' = c + a cb 'b', 



c^b' l{b')>l 
b'+c 



for some real a c y . 



Proof. The summand — ^ — mt a l A^ (c) involves terms of length at least a, from which the second 
expression of e(c) is immediate. Each term b' of e(c) appears in ^> a {c) for some a, hence c — > b' . 
Combine this with the knowledge from the second expression that c occurs with coefficient 1 to 
deduce the first expression. □ 

The two theorems below detail the construction of an eigenbasis for ^ a in a polynomial algebra 
and in a cocommutative free associative algebra respectively. These are left eigenvectors for the 
corresponding transition matrices. A worked example will follow immediately; it may help to read 
these together. 

Theorem 3.14. Let % be a Hopf algebra (over a field of characteristic zero) that is a polynomial 
algebra as an algebra, with monomial basis B. For b G B with factorization into generators b = 
C1C2...Q, set 

g b := e (ci) e (c 2 ) ...e (q) . 
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Then is an eigenvector of ^ a of eigenvalue a 1 satisfying the triangularity condition 

g b = b+Y,9b(b')b' = h+ Y, 9b(b')b'. 

b^b' l(b')>l(b) 
b'+b 

Hence {gb\b G B n } is an eigenbasis for the action of^ a on% n , and the multiplicity of the eigenvalue 
a 1 in % n is the coefficient of x n y l in Y\ i (l — yx" 1 ) \ where d{ is the number of generators of degree 
i. 

Theorem 3.15. LetH be a cocommutative H op f algebra (over afield of characteristic zero) that is a 
free associative algebra with word basis B. For b € B with factorization into generators b = c\C2-..ci, 
set gb to be the polynomial sym(6) evaluated at (e (ci) , e {pi) , . . . , e (q)). In other words, in the 
terminology of Section 2.3, 

• for c a generator, set g c := e(c); 

• for b a Lyndon word, inductively define gb '■= [gbi,9b 2 ] where b = 6162 is the standard factor- 
ization of b; 

• for b with Lyndon factorization b = b 1 ...b k , set g b : = J2aeS k 9b a[1) 9b a(2) ■ ■ ■ 9b a(k) ■ 

Then gb is an eigenvector of \I ,a of eigenvalue a k (k the number of Lyndon factors in b) satisfying 
the triangularity condition 

9b =Y 9b(b')b' = sym(6) + ^ gb{b')b'. 

b^b' l(b')>l(b) 

Hence {gb\b G B n } is an eigenbasis for the action of^ a on% n , and the multiplicity of the eigenvalue 
a k in Tin is the coefficient of x n y k in (l — yx*) dl , where di is the number of Lyndon words of 
degree i in the alphabet of generators. 

Remarks. 1. If ^ a defines a Markov chain, then the triangularity of gb (in both theorems) has 
the following interpretation: the left eigenfunction gb takes non-zero values only on states 
that are reachable from b. 

2. The expression of the multiplicity of the eigenvalues (in both theorems) holds for Hopf algebras 
that are bigraded, trigraded, etc, if we replace all xs, ns and is by pairs, triples, etc, and read 
the formula as multi-index notation. For example, for a bigraded polynomial algebra H, the 

multiplicity of the a -eigenspace in H m ,n is the coefficient of x™xV^y in TJ f . 1 1 — yx\x 3 2 



where dij is the number of generators of bidegree This idea will be useful in Section 5 



3. Theorem 3.15 essentially states that any cocommutative free associative algebra is in fact 
isomorphic to the free associative algebra, generated by e(c). But there is no analogous 
interpretation for Theorem 3.14; being a polynomial algebra is not a strong enough condition 
to force all Hopf algebras with this condition to be isomorphic. A polynomial algebra H is 
isomorphic to the usual polynomial Hopf algebra (i.e. with primitive generators) only if H is 
cocommutative; then e(c) gives a set of primitive generators. 

Example 3.16. As promised, here is a worked example of this calculation, in the noncommutative 
Hopf algebra of labeled graphs, as defined in Example |3.2[ Let b be the graph 
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12 3 4 = 



(• X 00 ) 



which is the product of three generators as shown. (Its factors happen to be its connected compo- 
nents, but that's not always true). Since the ordering of generators refines the ordering by degree, 
a vertex (degree 1) comes before an edge (degree 2), so the Lyndon factorization of b is 

^ 1 2 3^1 

So (ft is defined to be 

g b = g ii : g : + g. g. 



1 2 3 



The first Lyndon factor of b has standard factorization 

= (')(^) 



1 2 3 



SO 



g — 3 = [e(;),e(— 2 )] 



I 2 3 _ L V i / ? V | 2 

The Eulerian idempotent map fixes the single vertex, and 

e(; ;)= ; ; - \ mA 



2 12 12 

thus substituting into the previous equation gives 

(l 2 1 2 ) ( 1 2 12)1 



I 2 



" 1 2 3 1 2 3 

Since 

g.= e(.) 



1 1 

2 



returning to the first expression for gb gives the following eigenvector of eigenvalue a 
g b = (• ■ • - • ■ •)• + •(• ■ * " * ■ * ) 

u \12 3 1 23/1 1 \ 1 2 3 1 23/ 



1 2 3 4 1 2 3 4 
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Proof of Theorem 3.14 (polynomial algebra). By Patras [Pat93], the Eulerian idempotent map is a 
projection onto the a-eigenspace of so, for each generator c, e(c) is an eigenvector of eigenvalue 
a. As % is commutative, ^> a is an algebra homomorphism, so the product of two eigenvectors is 
another eigenvector with the product eigenvalue. Hence := e(ci)e(c2)...e(q) is an eigenvector of 
eigenvalue a 1 . 



To see triangularity, note that, by Proposition 3.13 



\ 



V 



\ 

b+ 5c 1 (c / i)...5 Ci (cDc / i--.q. 



c'-j^Ci for some i 



Lemma 3.7 shows that b — )■ . . . d, in each summand, and the condition c! i ^ Ci for some i means 
precisely that c[ . . .c' t ^ b. Also, by Proposition 3.13 



9b 



!(d)>i 



b + 



E 



Q+ Yl 9cM)4 

l{c[)>l 

g^i) ■ ■ ■ gcM) ! ■ ■ ■ 4 



l(c' i )>l for some i 

and thus Z(cj . . . cj) > ^ as length is multiplicative. 

The multiplicity of the eigenvalue a' is the number of basis elements b with length I. The last 
assertion of the theorem is then immediate from [Wil9CH Th. 3.14.1]. □ 

Example 3.17. We show that = e^(b), where the higher Eulerian idempotents are defined by 



By Patras [Pat93], is a projection to the a J -eigenspace of ^f a , so, given the triangularity condition 
of the eigenbasis {gb}, it suffices to show that b is the only term of length 1(b) in e^b)(b). Note 
that e;(b)(6) is a sum of terms of the form e(b^)e(b^ 2 ))--- e (b(i)) for some b^ with bm &> ■ • • ® 6(j) a 
summand of A^(6). As e = on %o, the 6jS must be non-trivial. Hence each term b' of em,)(6) 
has the form 6' 



6fo . . 



. b'^y with ->■ and b ->■ . . . o (i ). 
so 6 — )• 6' by transitivity, which, by Lemma 



.6m- It follows from Lemma 



3.6 



3.7 



that 



means 1(1/) > 1(b) unless 



• • • hi) 
b' = b. 

It remains to show that the coefficient of b in e^b)(b) is 1. Let b = c\ . . . c\ be the factorization 
of b into generators. With notation from the previous paragraph, taking b' = b results in b — > 
. . . fy) — > bLx . . . b'/n = b, so b = b^ . . . byy This forces the b 

bu\ occurs with coefficient 1 in e(b^), the coefficient of b^ 
coefficient of c 



'(0 



c CT (i) for some a £ Si. As 
m (e®---®e)AW(6) is the 



occurence of c. 



Ml) 



c a{l) m AW(6) = AW(ci) . . . AW(q), which is 1 for each <t G Each 
in (e<8>- ■ -®e) A M (6) gives rise to a 6 term in (e<8)e(g)- • -(g)e) A^ (b) with 



the same coefficient, for each a £ Si, hence b has coefficient l\ in mfi\e®e®- ■ ■ e)A^(6) = Z!e;(6). 



The same argument also shows that, if i < /(&), then ej(6) 
in ei(b). In particular, e(6) = if b is not a generator. 



0, as there is no term of length i 
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Proof of Theorem 3.15 (cocommutative and free associative algebra). Schmitt Sell!) 1, Thm9.4] shows 
that the Eulerian idempotent map e projects a graded cocommutative algebra onto its subspace of 
primitive elements, so g c := e(c) is primitive. A straightforward calculation shows that, if x,y G % 
are primitive, then so is [x,y]. Iterating this implies that, if b is a Lyndon word, then (which is 
the standard bracketing of e(c)s) is primitive. Now apply the symmetrization lemma (Lemma |3.9| ) 
to deduce that, if b G B has k Lyndon factors, gb is an eigenvector of eigenvalue a k . 

To see triangularity, first recall that sym is a linear combination of the permutations of its 
arguments, hence gb is a linear combination of products of the form e(c ff nj) . . . e(c a m) for some 

c', and by 



a £ Si. Hence, by Proposition 
we have b 



Lemma 



3.7 



3.13 



each term in Qh has the form d 



Mi) 



. . c 



a(iy 



Also, by Proposition 



3.13 



Vr(l) 



C a(l) With C i 



Hi, - sym | \c\ + fc(ci)c'i J Q+ ^ 5 C; (c'/)q 

i(c' 1 )>l / \ l(c|)>l 

= sym(6) + X sym (g Cl (c'^c'^ ...,g Cl (4)4) , 

l(c' i )>l for some j 

and all terms of the sum has length greater than I, as length is multiplicative, and sym is a linear 
combination of the permutations of its arguments. 

The multiplicity of the eigenvalue a k is the number of basis elements with k Lyndon factors. 
The last assertion of the theorem is then immediate from [Wil90, Th. 3.14.1]. □ 

3.6 Right eigenvectors 

To obtain the right eigenvectors for our Markov chains, consider the graded dual T~L* of the algebras 
examined above. The multiplication A* and comultiplication m* on T~L* are given by: 

(x*y*) (z) =(A* (x* ® y*)) (z) =(x* ® y*) (Az) 

(m*x*) (z (g) w) = x*m(z w) =x* (zw) 



for any x*,y* G H*, z,w G U. Then := A*^m*^ is the dual map to So, if K a , 

defined by a~ n ^ a (b) = ^2 b , K a (b,b')b' , is a transition matrix, then its right eigenvectors are the 
eigenvectors of <I'* a . The theorems below express these eigenvectors in terms of {b*}, the dual 
basis to B. Dualizing a commutative Hopf algebra creates a cocommutative Hopf algebra, and vice 



versa, so Theorem 3.18 below, which diagonalizes on a polynomial algebra, will share features 



with Theorem 3.15, which diagonalizes ^ a on a cocommutative free associative algebra. Similarly, 
Theorem 3.19 and Theorem 3.14 will involve common ideas. However, Theorems 3.18 and 3.19 



are not direct applications of Theorems 3.15 and 3.14 to 7i* as H* is not a polynomial or free 
associative algebra - a breaking and recombining chain with a deterministic recombination does 
not dualize to one with a deterministic recombination. For example, the recombination step is 
deterministic for inverse shuffling (place the left pile on top of the right pile), but not for forward 
riffle shuffling (shuffle the two piles together). 



The two theorems below give the eigenvectors of <3/* a ; exemplar computations are in Section 4.2 



Theorem 3.18 gives a complete description of these for T~L a polynomial algebra, and Theorem 3.19 
yields a partial description for % a cocommutative free associative algebra. To correctly normalize 
the eigenvectors requires one more definition: let a c (b) be the number of factors of b that are equal 
to c (for b G B, c a generator). For example, a c (cc'c) = 2. 
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Theorem 3.18. Let % be a Hopf algebra (over a field of characteristic zero) that is a polynomial 
algebra as an algebra, with monomial basis B. For b £ B with factorization into generators b = 
c\C2---ci, set 

\ c / aeSi 

Then fj, is an eigenvector of of eigenvalue a 1 , and /&(&') is the sum (over a £ Si) of the 
coefficients of c CT m <g) Ca{2) ® ' " " ® c a(i) A^(6') ; multiplied by (n c a c(^)0 _1 jr- fb s^sfies the 
triangularity condition 

fb = b* + Y J h(b')b'* = b+ Yl h{b')b'\ 

b'^b l(b')<l(b) 
b'+b 

Furthermore, {/&} is the dual basis to {gb}- In other words, fb {gy) = if b ^ b' , and fb(gb) = 1- 
Yet another formulation: the matrix with fb as its columns is the inverse of the matrix with gb as 
its rows. 

Remarks. 1. If % is also cocommutative, then it is unnecessary to symmetrize - just define 

fb = (UMbyr 1 c* 1 c* 2 ...ci 

2. If *$> a defines a Markov chain on B n , then the theorem says fb(b') may be interpreted as 
the number of ways to break b' into / pieces so that the result is some permutation of the I 
generators that are factors of b. In particular, fb takes only non-negative values, and /& is 
non-zero only on states which can reach b. Thus fb may be used to estimate the probability 
of being in states that can reach b, see Corollary 4.10| for an example. 



Theorem 3.19. Let H be a cocommutative Hopf algebra (over a field of characteristic zero) which 
is a free associative algebra as an algebra, with word basis B. For each Lyndon word b, let fb be the 
eigenvector of fy* a of eigenvalue a such that fb(gb) = 1 an d A (<?&') = f or a ^ other Lyndon b' . In 
particular, f c = c* and is primitive. For each basis element b with Lyndon factorization b = b\...bk, 
let ^ 

/&:= {\[a c {b)\\ f bl ...f bk 

Then fb is an eigenvector of^* a of eigenvalue a k , and {fb} is the dual basis to {gb}- Lfb = c\oi-.-ci 
with c\ > C2 > ... > ci in the ordering of generators, then fb(b') is the coefficient of ci <S> C2 <S> ... ® q 
in A^(b'), multiplied by dL^^n 1 ■ 



Proof of Theorem 3.18 (polynomial algebra). Suppose b* b* is a term in m*(c*), where c is a 
generator. This means m*(c*)(&® b') is non-zero. Since comultiplication in T~L* is dual to multipli- 
cation in H, m*{c*){b® b') = c*(bb'), which is only nonzero if bb' is a (real number) multiple of c. 
Since c is a generator, this can only happen if one of b, b' is c. Hence c* is primitive. Apply the 



symmetrization lemma (Lemma 3.9) to the primitives c*, . . . ,c* to deduce that fb as defined above 
is an eigenvector of eigenvalue a . 



Since multiplication in %* is dual to the coproduct in %, c t(i) c a(2) • • • c o-(q(^') = c o-(i) w °o-(2) 



c 



• • • <8> c * a m (&'))> from which the formula for fb(b') is immediate. 

It is apparent from the formula that b'* appears in fb only if c a m ■ ■ • c a n\ = b appears in ty l (b'), 
hence b — > b' is necessary. /&(&) is the sum over Si of the coefficients of c a m ® • • • ® c a m in 
A^(6) = A^(ci) . . . A^(q). Each term in AW(cj) contributes at least one generator to at least one 
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tensor- factor, and each tensor-factor of c^m ® • • • ® c^m is a single generator, so each occurrence 
of c CT n) ® ■ ■ ■ ® c CT (n is a product of terms from A^(ci) where one tensor-factor is Cj and all other 
tensor- factors are 1. Such products are all V. permutations of the Cj in the tensor- factors, so, for 
each fixed a, the coefficient of c a m ® • • • ® c^n in A^ (6) is J} c a c (6)!. This proves the first equality 
in the triangularity statement. Triangularity of fb with respect to length follows, as ordering by 



length refines the relation — > (Proposition 3.6). 



To see duality, first note that, since \I/* a is the linear algebra dual to \I/ a , /& (^f a g b ') = (^* a fb) (gb')- 
Now, using that fb and gb are eigenvectors, it follows that 

a m h (gv) = f b (^gv) = ** a f b (gv) = a l ®f b (g v ) , 

so f b (g b >) = if l(b')^ 1(b). 

Now suppose l(b') = 1(b) = I. Then 



fb(gy) =\b*+ Y, fb(M% )\ b + E 9b'(b 2 )b2 = b*(b>), 

\ l(bi)<l ) \ l(b 2 )>l ) 

which is when b ^ b' , and 1 when b = b' . □ 



Proof of Theorem 3.19 (cocommutative and free associative algebra). H* is commutative, so the power 
map is an algebra homomorphism. Then, since fb is defined as the product of k eigenvectors each 
of eigenvalue a, fb is an eigenvector of eigenvalue a k . 



For any generator c, c* is primitive by the same reasoning as in Theorem 3.18, the case of 
a polynomial algebra. To check that c*(g c ) = 1 and c*(gb) = for all other Lyndon b, use the 
triangularity of g b . 

c*(g b ) = c*(sym(b)) + Y 9b(b')c* (b'). 

l(b')>l(b) 

Each summand c*(b') in the second term is as 1(c) = 1 < 1(b) < l(b'). As sym(6) consists of terms 
of length 1(b), c*(sym(6)) is unless 1(b) = 1, in which case sym(6) = b. Hence c*(gb) = c*(sym(6)) 
is non-zero only if c = 6, and c*(g c ) = c*(sym(c)) = c*(c) = 1. 

Turn now to duality. An analogous argument to the polynomial algebra case shows that 
f b (g b ') 7^ only when they have the same eigenvalue, which happens precisely when b and b' 
have the same number of Lyndon factors. So let b\...bk be the decreasing Lyndon factorization of 
b, bi...b' k be the decreasing Lyndon factorzation of b' . To evaluate 



fb(gv) = rQ ac ( 6 ) ! fbi-fbk E 9b '<r(i)~ 9b : 
\ c / \ CTe s fc 



tr(fc) 

, es fc 
observe that 

f bl ...f h {gy a{1) ...gy a{ J = (A, ® - ® /&J A™ J 

= (/ 6i ®...®/ 6 j(aw(, 6 , (i) )...aw( %( J). 

As gy is primitive, each term in A^ ( ) has g b > in one tensor-factor and 1 in the k — 1 

o-(i) \ tr(i)/ cr(i) 

others. Hence the only terms of A^l (gb' ) ...A^ ( gy ) without Is in any tensor factors are 
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those of the form gy (g) ... ® for some r G Sfc. Now = for all i, so (8) ... <8> /fe fe 

tct(1) tct(A:) 

annihilates any term with 1 in some tensor-factor. Hence 



\a&S k J \a,T£S k 



As /& is dual to for Lyndon 6, the only summands which contribute are when &, = &^ T ^\ for all i. 
In other words, this is zero unless the bi are some permutation of the b^. But both sets are ordered 
decreasingly, so this can only happen if bi = b\ for all i, hence b = b' . In that case, for each fixed 
a G Sk, the number of r G Sfc with b{ = b aT ^ for all i is precisely n c a cW> so fb(db) = 1- 

The final statement is proved in the same way as in Theorem |3,18[ for a polynomial alge- 
bra, since, when b = C1C2...Q with c\ > C2 > ... > Q in the ordering of generators, /& = 

(n^cW!)- 1 ^^...^. 

□ 

3.7 Stationary distributions, generalized chromatic polynomials, and absorp- 
tion times 



This section returns to probabilistic considerations, showing how the left eigenvectors of Section 3.5 
determine the stationary distribution of the associated Markov chain. In the absorbing case, "gen- 
eralized chromatic polynomials" , based on the universality theorem in [ABS06] , determine rates of 
absorption. Again, these general theorems are illustrated in the three sections that follow. 

3.7.1 Stationary distributions 

The first proposition identifies all the absorbing states when T~L is a polynomial algebra: 

Proposition 3.20. Suppose H is a polynomial algebra where K a , defined by a~ n ^ a (b) = Y^y K a (b, b') 
is a Markov kernel. Then the absorbing states are the basis elements b G B n which are products of 
n (possibly repeated) degree one elements, and these give a basis of the 1-eigenspace of K a . 



Example 3.21. In the commutative Hopf algebra of graphs in Examples 2.1 and |3.1[ there is a 
unique basis element of degree 1 - the graph with a single vertex. Hence the product of n such, 
which is the empty graph, is the unique absorbing state. Similarly for the rock-breaking example 
(symmetric functions) on partitions of n, the only basis element of degree 1 is e\ and the stationary 
distribution is absorbing at 1™ (or e\n). 

The parallel result for a cocommutative free associative algebra picks out the stationary distri- 
butions: 

Proposition 3.22. Suppose % is a cocommutative and free associative algebra where K a , defined by 
a~ n ^> a (b) = J2b' Ka(b, b')b' , is a Markov kernel. Then, for each unordered n-tuple {ci, C2, . . . , c n } of 
degree 1 elements (some may be identical), the uniform distribution on (c -( 1 )C -( 2 ) . . . cv^ler G S* n } 
is a stationary distribution for the associated chain. In particular, all absorbing states have the form 
c™, where c, has degree 1. 
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Example 3.23. In the free associative algebra M.(x±,X2, ■ ■ ■ ,x n ), each Xi is a degree 1 element. 
So the uniform distribution on x a n\ . . . £ CT ( n ) (a £ S n ) is a stationary distribution, as evident from 
considering inverse shuffles. 



Proof of Proposition 3.20, From Theorem 3.14 a basis for the 1-eigenspace is {gb\l(b) = n}. This 
forces each factor of b to have degree 1, so b = c\c<i . . . c n and gb = e (ci) . . . e (c n ). Now e(c) = 

Ea>i ^ir— m [a] A[ a] (c), and, when deg(c) = 1, m^A^(c) = for all a > 2. So e(c) = c, and 
hence <#> = c\C2 ■ ■ ■ c n = b, which is a point mass on 6, so b = c\C2 ■ ■ ■ c n is an absorbing state. □ 



Proof of Proposition \3 '.22 , From Theorem 3.15 a basis for the 1-eigenspace is {gb\b G i3 n ,6 has 



n Lyndon factors}. This forces each Lyndon factor of b to have degree 1, so each of these must 
in fact be a single letter of degree 1. Thus b = c\C2.-.c n and = YlaeS n 9c<r(i) ■ ■ ■ 9^(n) = 
So-eSn c f(i) ' " ' c °-(")> as 5'( c ) = c for a generator c. An absorbing state is a stationary distribution 
which is a point mass. This requires c a i\\ . . . c a r n -\ to be independent of a. As % is a free associative 
algebra, this only holds when ci = • • • = c n =: c», in which case gb = n!c™, so c™ is an absorbing 
state. □ 

3.7.2 Absorption and chromatic polynomials 



Consider the case where there is a single basis element c, of degree 1. Then, by Proposition 3.20 



and Proposition 3.22 the K a chain has a unique absorbing basis vector c™ E 7-L n . The chance 
of absorption after k steps can be calculated using an analog of the chromatic polynomial. Note 
first that the property K a * K a i = K aa > implies it is enough to calculate K a (b,c™) for general a 
and starting state b G T~L n . To do this, make % into a combinatorial Hopf algebra in the sense of 
[ABS06J by defining a character £ that takes value 1 on c, and value on all other generators, 
and extend multiplicatively and linearly. In other words, Q is an indicator function of absorption, 
taking value 1 on all absorbing states and on all other states. By |ABS061 Th. 4.1] there is a 
unique character-preserving Hopf algebra map from T~L to the algebra of quasisymmetric functions. 
Define Xb to be the quasisymmetric function that is the image of the basis element b under this 
map. (If 7-^ is cocommutative, Xb will be a symmetric function.) Call this the generalized chromatic 
polynomial of b since it is the Stanley chromatic polynomial for the Hopf algebra of graphs [Sta95j. 

Proposition 3.24. With notation as above, the probability of being absorbed in one step of K a 
starting from b (that is, K a (b,c™)) equals 

X6 (^j 0, 0, ... ) (first a arguments are non-zero). 



Proof. By definition of K a , the desired probability K a (b,c™) is a~ n times the coefficient of 
in ty a (b). Every occurrence of c™ in ^ a (b) = m^A^(b) must be due to a term of the form 
c" 1 <8> c^ 2 (8> • • • ® c^ a in A^(b), for some composition a = («i, . . . ,a n ) of n (some a« may be 0). 
So, in the notation of the proof of Theorem |3.4[ 

K a (b,c?) = a-"J2%' 1 ' C * 2 '-* C - a > 

a 

where the sum runs over all a with a parts. The coefficient of the monomial M a in Xb is defined 
to be the image of b under the composite 

n ^ n®" "I*"**-, n ai ® • • • ® n aa ^ r 
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where, in the middle map, ir ai denotes the projection to the subspace of degree ct{. As C takes value 
1 on powers of c, and on other basis elements, it transpires that 



Xb 



EC. ,c.' 



summing over all compositions of n regardless of their number of parts. Hence 
K a (b, c?) = a- n Xb (l, 1, ■ ■ ■ , 1, 0, . . . ) = X b h ■ ■ ■ , \, 0, 0, . . . ) 
with the first a arguments non-zero. 



□ 



Using a different character £, this same argument gives the probability of reaching certain sets 
of states in one step of the K a (b, — ) chain. This does not require H to have a single basis element 
of degree 1. 

Proposition 3.25. Let C be a subset of generators, and £ c be the character taking value 1 on C 
and value on all other generators (extended linearly and multiplicatively) . Let X\> be the image 
of b under the unique character-preserving Hopf map from H to the algebra of quasisymmetric 
functions. Then the probability of being at a state which is a product of elements in C, after one 
step of the K a chain, starting from b, is 

£,0,0,...) 



Xb ( a i a ' 



(first a arguments are non-zero). 



Example 3.26 (Rock-breaking). Recall the rock-breaking chain of Example |3.3| Let C = {ei,e2}. 
Then x% (2 _fe , 2~ k , . . . , 2 _fc , 0, 0, ... ) measures the probability that a rock of size n becomes rocks 
of size 1 or 2, after k binomial breaks. 



4 Symmetric functions and breaking rocks 



This section studies the Markov chain induced by the Hopf algebra of symmetric functions. Sec- 



tion 4.1 presents it as a rock-breaking process with background and references from the applied 



probability literature. Section 4.2 gives formulae for the right eigenfunctions by specializing from 



Theorem 3.18 and uses these to bound absorption time and related probabilistic observables. Sec- 



tion 4.3 gives formulae for the left eigenfunctions by specializing from Theorem 3.14 and uses these 
to derive quasi-stationary distributions. 

4.1 Rock-breaking 



As in Examples 1.2 and 3.3 the Markov chain corresponding to the Hopf algebra of symmetric 
functions may be described as a rock-breaking process on partitions of n: at each step, break each 
part independently with a symmetric binomial distribution. The chain is absorbed when each part 
is of size one. Let P n (X, /i) be the transition matrix or chance of moving from A to /U in one step 
for A, // partitions of n. For n = 2, 3, 4, these matrices are: 
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= 2 




l 2 
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l 2 


1 
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= 3 
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= 4 
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The oth power map \& a yields rock-breaking into a pieces each time, according to a symmetric 
multinomial distribution. To simplify things, we focus on a = 2. 

The mathematical study of rock-breaking was developed by Kolmogoroff [Kol4l] who proved a 
log normal limit for the distribution of pieces of size at most x. A literature review and classical 
applied probability treatment in the language of branching processes is in [A N 72 ] . They allow more 
general distributions for the size of pieces in each break. A modern manifestation with links to many 
areas of probability is the study of fragmentation processes. Extensive mathematical development 
and good pointers to a large physics and engineering literature are in |Ber03l IBer06j . Most of the 
probabilistic development is in continuous time and has pieces breaking one at a time. We have 
not seen previous study of the natural model of simultaneous breaking developed here. 

The rock-breaking Markov chain has the following alternative "balls in boxes" description: 

Proposition 4.1. The distribution on partitions of n induced by taking k steps of the chain P n 
starting at the one-part partition (n) is the same as the measure induced by dropping n balls into 
2 k boxes (uniform multinomial allocation) and considering the partition induced by the box counts 
in the non-empty cells. For X a partition of n, written as l ai ( A ) ... re a «( A ) with ia^(A) = n, 

set P*((n), A) = 2nfc |-p 2 ( a .;)a t Y\ n \i\) a i • (^ ere ' a o(A) is the number of empty cells.) Then, for any 
partition ji 

Pnifi, A) = P* * P^ 2 * • • • * (usual convolutions of measures on M. n ). 



Remarks. Because of Proposition 4.1, many natural functions of the chain can be understood, as 
functions of k and re, from known properties of multinomial allocation. This includes the distribu- 
tion of the size of the largest and smallest piece, the joint distribution of the number of pieces of 
size i, and total number of pieces. See |KSC78| IBHJ92]. 

Observe from the matrices for n = 2,3,4 that, if the partitions are written in reverse- lexicographic 
order, the transition matrices are lower-triangular. This is because lexicographic order refines the 
ordering of partitions by refinement. Furthermore, the diagonal entries are the eigenvalues 1/2 1 , as 



predicted in Proposition 3.6 Specializing the counting formula in Theorem 3.14 to this case proves 
the following proposition. 

Proposition 4.2. The rock-breaking chain P n (A, (ji) on partitions of size re has eigenvalues 1/2* , < 
i < n— 1, with 1/2* having multiplicity p(n,i), the number of partitions of n into i parts. 

4.2 Right eigenfunctions 

Because the operator P n (A,/u) is not self-adjoint in any reasonable sense, the left and right eigen- 



functions must be developed distinctly. The following description, a specialization of Theorem 3.18 
may be supplemented by the examples and corollaries that follow. A proof is at the end of this 
subsection. 

Proposition 4.3. Let //, A be partitions of n. Let Oj(A) denote the number of parts of size i in A 
and l{\) the total number of parts. Then the fj,th right eigenfunction for P n , evaluated at A, is 

w=n^En Xjl 



Hi Mi! ^ oi (^)!o2 <y' )! . . . a Aj OP')! 



:> 



where the sum is over all sets {/J, J } such that is a partition of Xj and the disjoint union lljfjp = 
The corresponding eigenvalue is 2 l ^~ n . /^(A) is always non-negative, and is non-zero if and only 
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if [A is a refinement of X. If X is any set partition with underlying partition X, then /u(A) is the 
number of refinements of X with underlying partition \i. 

Here is an illustration of how to compute with this formula: 

Example 4.4. For n = 5, yu = (2, 1, 1, 1), A = (3, 2), the possible {^} are 

A* 1 = (2, 1), /i 2 = (l,l) 

= (1,1,1), A* 2 = (2). 

Then 

f (\)- 1 ( 3! 2! i 3!2! 

M ; 2!1!1!1! V 1!1! 2! 3! 1! 



Example 4.5. For n = 2, 3, 4, the right eigenfunctions are the columns of the matrices: 



1 1 
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For some A, /i pairs, the formula for /^(A) simplifies. 
Example 4.6. When A = (n), 

1 r?' 

/ (( n )) = 

Example 4.7. fin = 1 has eigenvalue 1. 

Example 4.8. As (n) is not a refinement of any other partition, _/7 n ) is non-zero only at (n). Hence 

/(n)(A) = £(n)(A). 

Example 4.9. When fj, = l n ~ r r (r / 1), 

f>w = E 0') 

3 

with eigenvalue l/2 r_1 . Thus /in-22(A) = Yl C2) w i^h eigenvalue 1/2 is the unique second-largest 
eigenfunction. 

Example |4.9| can be applied to give bounds on the chance of absorption. The following corollary 
shows that absorption is likely after k = 2 log 2 n + c steps. 



32 



Corollary 4.10. For the rock-breaking chain Xq = (n), X\, X 2 , ■ ■ ■ , 



P{X k + 1") < ™ 



Proof. By Example 4.9, if fi = l n 2 2, /^(A) = ^ (2') is an eigenfunction with eigenvalue 1/2. 
Further, /„ is zero if and only if A = 1™. As an eigenfunction 



P{X k + 1"} = P{f(X k ) > 1} < E(f,(X k )) - 2 , . 



□ 



Remarks. 



1. From Proposition 4.1 and the classical birthday problem 

i 

2^ 



P{X k + 1"} = nfl - ~ k \ = e E -K<-*) = e~ E - = e -t + °l^ 



i=i v 

It follows that, for k = 2 log 2 n + c (or 2 fc = 2 c n 2 ) the inequality in the corollary is essentially 
an equality. 

2. Essentially the same calculations go through for any starting state A using 

/u(A) 



Ex (U(X k )) 



2 k 



3. Other eigenfunctions can be similarly used. For example, when fi = 1™ r r, /^(A) = ^ • (^f) > 
if and only if maxj Xj > r. It follows as above that 

(n\ 

P (n) {maxX M >r} = P {n) {f^X k ) > 1} < 



2(r-l)fc ' 



4. The right eigenfunctions with \x = l n r r can be derived by a direct probabilistic argument. 
Drop n balls into 2 k boxes. Let Ni be the number of balls in box i. Then 



The last equality follows because iVi is binomial (n, l/2 k ) and, if X is binomial (n,p), E(X(X— 
1) • • • (X — r + 1)) = n(n — 1) • • • (n — r + l)p r - The other eigenvectors can be derived using 
more complicated multinomial moments. 



Proof of Proposition For concreteness, take Z(A) = 2 and l(fj,) = 3. Then Theorem 3. 18| states 
that ffj,(X) is the coefficient of pL\ (g) [i 2 (g) in Al 3 l(A) (viewing as a partition of single part), 
divided by Yliai(n)\. Recall that 

A^(X) = A^(X 1 )A^(X 2 ) = V (. X \)(. X . 2 .)iiU l2 ® J1 Uj 2 ®k 1 Uk 2 . 

-T^._x ViJihJ \i 2 j 2 k 2 ) 



h+ji+ki=Xi 



33 



So calculating / M (A) requires summing the coefficients of the terms where %\ II ii = n\, j\ II ji = 
fi2, ki]Ik2 = /U3. As fj,\ only has one part, it must be the case that either i\ = fi\ and 22 = 0, or i\ = 
and %2 = Mi, and similarly for ^2,1^3- Thus, removing the parts of size from k\, i2,j%, fe) 

and reordering gives fi. So (jJ^J G 2 j 2 2 fc 2 ) = Jj^^l - Also i if z" 1 denotes the partition obtained by 
removing 0s and reordering ki), and from {12,32, ^2), then the disjoint union /i 1 II// 2 = /i. 

Given /z 1 and /x 2 , the number of different sextuples ki,i2,j2, ^2) it could have come from is 



n 

Hence 



aj (//) a, O 2 ),/' 



V « / w M2 ^ i ^ ) ai ^ 

which simplifies as desired. It is then an easy exercise to check that this is the number of refinements 
of the set partition A of underlying partition /i. □ 

4.3 Left eigenfunctions and quasi-stationary distributions 



This subsection gives two descriptions of the left eigenfunctions: one in parallel with Proposition 4.3 
and the other using symmetric function theory. Again, examples follow the statement with proofs 
at the end. 

Proposition 4.11. For the rock-breaking Markov chain P n on partitions of n, for each partition 
A of n, there is a left eigenfunction g\(fJ>) with eigenvalue l/2 n ~ l ( x \ 

5A ( M ) = A 1 !A 2 !...A i(A) !^-^ -11 



where the sum is over sets {ft- 1 } such that ji 3 is a partition of Xj and U/x- 7 = [i. g\([i) is non-zero 
only if fj, is a refinement of A. 

As previously, here is a calculational example. 

Example 4.12. When fi = (2, 1, 1, 1), A = (3, 2), 



Example 4.13. For n = 2, 3, 4, the left eigenfunctions g\ are the rows of the matrices: 
n = 2 
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-1 1 
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Observe that these matrices are the inverses of those in Example 4.5 , as claimed in Theorem 3.18 
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The next three examples give some partitions A for which the expression for g\{[i) condenses 
greatly: 

Example 4.14. If A = (n) then gr n \ is primitive and 

(-l)M-i (Z(m)-1)! 



5(n)(^) = « 



MiW . . ai 0)!a 2 (ju)! . . .a Aj (ju)! 

with eigenvalue l/2 n . 

Example 4.15. If A = \ n ~ r r (r 7^ 1), g A has eigenvalue l/2 r and 

(/(^ -n + r - 1)! 



5a(X) = r\ 



fj,i\/j, 2 \ ■ ■ ■ 01 (/x-n + r)!a 2 (//)!... (//)! 



if ai(/i) > ai(A), and otherwise. In particular, <7in-2 2 puts equal mass at [i = 1" and /i = l n 2 2 
with mass for other ju. 

Example 4.16. Take A = l n . As no other partition refines A, gx n ((J>) = #]»»(//), and this is the 
stationary distribution. 

The left eigenfunctions can be used to determine the quasi-stationary distributions tt , 7r 2 
described in ISection 2.11 

Corollary 4.17. For i/ie rock-breaking Markov chain P n on partitions of n, 

vr^/i) = tt 2 (/x) = 6 ln -2 2 (fj,) for fi ^ l n . 



Proof. From (2.4), 7T 1 is proportional to g\r. 



theorem ensures that ir 1 is non-negative. From Examples 4.15 and 4.16 
Similarly, ir 2 is proportional to <? 2 i(/i)/2i(/-*) 



2 2 on the non-absorbing states. The Perron-Frobenious 

1 



tt = <5 1 n-2 2 (/i) for /1 / 1' 



$!»-2 2 (a0 for // / l n . 



□ 



From [Gei77j , the power sum symmetric functions p n are the primitive elements of the ring of 
symmetric functions and their products p\ give the left eigenfunctions of the Hopf-power chains. 
Up to scaling, p n is the only primitive element of degree n, so p n must be a scalar multiple of gi n y 
#( n ) is normalized so that the coefficient of e n is 1 (recall e n 



By Theorem 
determinant 



3.14 

formula |Mac95l p. 28]: 



i\e r 



whilst the 



Pn 



det 



/ex 

2e 2 

\ne n 



1 

ei 




1 



e n -i e n -2 



0\ 


exj 



shows that e n has coefficient (— l) n n'mp n . Comparing these show gr n \ = (— 1)" (n— l)!p n , so the 
left eigenfunctions g\ are (— l) n_/ ( A ) FJ^ (Aj — 1)!pa expressed in the e A basis. Hence Proposition 



4.3 



and Proposition 4.11 may be rephrased as follows, respectively: 

(I W) 



Px = (-i)n+m Xl \ 2 ...\ l{X) n 



1)! 



}-ax( f P)\a 2 ( t jJ)\...a Xi (f J J)\ " 
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e =(-i)n+m v rr AiA2 --- A ^ v , 

" 1 j ^ a;i /( .0!, 2 i//.M!...a A! S / ,M! / ' A 

fJ, 3 FAj 3 

II/*>'=/x 

For example, for A = l n , pi = e±, and pin = ei« = ejn corresponding to the left eigenvector 
(1,0, . . . , 0) with eigenvalue 1. For A = 21 n_2 , p 2 = e\ — 2e 2 = e\ — e 2 , so P]n-2 2 = e\n — 
ein-2 2 corresponding to the eigenvector (1, —1,0, ... ,0) with eigenvalue 1/2. For A = l n_3 3, P3 = 



( ei 1 



det 2e 2 ei l = e± — 3eie 2 + 3e3 = — %£\e2 + 563. Multiplying by 2 gives the left eigenvector 

\3e 3 e 2 e\ J 

(2, -3, 1, 0, . . . , 0) with eigenvalue 1/4. 

Proof of Proposition ^. 11 . g\(jjL) is the coefficient of [i in e (Ai) e (A 2 ) . . . e (A/(a)) • Every occurence 
of /j, in e (Ai) . . . e (Ajrn) is a product of a ^i 1 term in e(Ai), a /x 2 term in e(A 2 ), etc., for some choice 
of partitions ^ of Xj with H,/^ = fi. Hence it suffices to show that the coefficient of a fixed n J in 
e(Xj) is 

(-l)^)-^!^)-!)! 
a 1 (^)\---a x .(^)\f,{\---^ ) \' 

Recall that e(Xj) = J2 a >i — m^A^(Xj), and observe that all terms of m'"' A^(Xj) are parti- 
tions with a parts. Hence fi J only occurs in the summand with a = l^ 3 )- So the number needed is 

— multiplied by the coefficient of /i J in m^A^^Xj). Each occurrence of ^ in m^A^ (Xj) 

is caused by M^-m ® ' ' ' ® ^(a) m ^ or some a £ "S"a- For each fixed a, /^n) ® • • • ® M^a) 

has coefficient 

-\? \ ( ^3 



3 3 I \ 3 3 



in A' a ' (Aj), and the number of cr € S a leading to distinct a-tuples Mo-riv • • • , /-v( a ) * s aT 



(M3')!...a A ,(AtJ)!- 



Hence the coefficient of u? m m' W °J (A,) is ( , , ) — ? ,m ? ,m as desired. □ 

Note. This calculation is greatly simplified for the algebra of symmetric functions, compared to 
other polynomial algebras. The reason is that, for a generator c, it is in general false that all terms 
of 7W- a * At"! (c) have length a, or equivalently that all tensor- factors of a term of A^ (c) are generators. 
See the fourth summand of the coproduct calculation in Example 2.2 for an example. Then terms 
of length say, three, in e(c) may show up in both m^A^(c) and m^A^(c), so determining the 
coefficient of this length three term in e(c) is much harder, due to these potential cancellations in 
e(c). Hence much effort [FislO, AS051 [A"S06j has gone into developing cancellation-free expressions 
for primitives, as alternatives to e(c). 



5 The free associative algebra and riffle shuffling 



This section works through the details for the Hopf algebra fc(xi,x 2 , . . . , xjsr) and riffle shuffling 
(Examples |1.1| and 1.3). Section 5.1 gives background on shuffling, Section 5.2 develops the Hopf 
connection, |Section 5.3 gives various descriptions of right eigenfunctions. These are specialized to 



decks with distinct cards in 



Section 5.3.1 



which shows that the number of descents- 



n-l 



and the 



number of peaks— are eigenfunctions. The last section treats decks with general composition 
showing that all eigenvalues 1/2* , < i < n—1, occur as long as there are at least two types of cards. 
Inverse riffle shuffling is a special case of walks on the chambers of a hyperplane arrangement and 
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of random walks on a left regular band. |Sall2| and |Denl2] give a description of left eigenfunctions 
(hence right eigenfunctions for forward shuffling) in this generality. 



5.1 Riffle shuffles 

Gilbert-Shannon-Reeds introduced a realistic model for riffle shuffling a deck of n cards. It may be 
described in terms of a parameterized family of probability measures Q a (c) for a in the symmetric 
group S n and a £ {1,2,3,...} a parameter. A physical description of the a-shuffle begins by 
cutting a deck of n cards into a piles according to the symmetric multinomial distribution, so 
the probability of pile % receiving m cards is ( nin "... nj ,)/ an - Then, the piles are riffled together 
by (sequentially) dropping the next card from pile % with probability proportional to pile size; 
continuing until all cards have been dropped. Usual riffle shuffles are 2-shuffles and }BD92j show 
that Q a * Qb(cr) = '}2 rj Qa{'n)Qb{^'n~ l ) = Qab{&)- Thus to study Q* 2 k {°) = Q2 k ( a ) ^ is enough to 
understand Q a {&)- They also found the closed formula 

(5.1) Qa{a)={^ + a ~ { y ) + l) ^j / a n , d(a) = # descents in a. 

Using this they proved that | log 2 n + c 2-shuffles are necessary and suffice to mix n cards. 

The study of Q a (cr) has contacts with other areas of mathematics: to Solomon's descent algebra 
[Sol68l lDFP92] . quasisymmetric functions |Sta99llStaOTllFul01j . hyperpl ane arrangements [BHR99, 
BD98, AD10J, Lie theory |Reu93} lReu03] . and, as the present paper shows, Hopf algebras. A survey 
of this and other connections is in [Dia03j with [ADSllJ ICH101 IDFHllj bringing this up to date. 
A good elementary textbook treatment is in [GS97 . 

Of course, shuffling can be treated as a Markov chain on S n with transition matrix K a (a, ir) = 
Qai^^ 1 ), the chance of moving from a to n after one a-shuffle. To check later calculations, when 
n = 3, the transition matrix is 1/a 3 times 

123 132 213 231 312 321 



123 
132 
213 
231 
312 
321 



rn (T) m m cd (D 

( a t) m (t 1 ) (a) ( a t) ( a t) 

h 1 ) m m m © m • 

h 1 (s) h 1 m ( a t) ( a f) 

(T) (Y) (3) ( a t) ( a t 2 ) ( a t) 

(3) ( a t) ( a t) ( a t) m 



It is also of interest to study decks with repeated cards. For example, if suits don't matter, the 
deck may be regarded as having values 1, 2, . . . , 13 with value i repeated four times. Now, mixing 
requires fewer shuffles; see |CV06|ICV071lA"DSllj for details. The present Hopf analysis works here, 
too. 



5.2 The Hopf connection 



Let H = k{x\,X2, ■ ■ ■ ,xn) be the free associative algebra on generators, with each Xi primitive. 
As explained in Examples 1.1 and 1.3 the map x — > $> a (x)/a desx is exactly inverse a-shuffling. 
Observe that the number of cards having each value is unchanged during shuffling. This naturally 
leads to the following finer grading on the free associative algebra: for each v = (ui, 1/2, • • • , vn) & 
N N , define H v to be the subspace spanned by words where x^ appears Vi times. The ath power 
map ^ a = m'^A'"' : % — > % preserves this finer grading. The subspace H x n C k{x\,X2, ■ ■ ■ , xjv) 
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is spanned by words of degree 1 in each variable. A basis is x a = cc CT - im • • • x a -ir n y The mapping 
^ a preserves T-L\n and ^ r ^ ,a (x cr ) = Yin Qa('KO~~ 1 )x. K . With obvious modification the same result 
holds for any subspace H u . Working on the dual space %* gives the usual Gilbert-Shannon-Reeds 
riffle shuffles. Let us record this formally; say that a deck has composition v if there are v% cards of 
value i. 

Proposition 5.1. Let v = U2, ■ ■ ■ , vn) be a composition of n. For any a £ {1,2,...}, the 

mapping preserves % u and the matrix of this map in the monomial basis is the transpose of 

the transition matrix for the inverse a-shuffling Markov chain for a deck of composition v. The 



dual mapping is the Gilbert-Shannon-Reeds measure (5.1) on decks with this composition. 



Note. Since the cards behave equally independent of their labels, any particular deck of interest can 
be relabeled so that v\ > V2 > • • • > vn- In other words, it suffices to work with T~L U for partitions 



5.3 Right eigenfunctions 



Theorem 3.15 applied to the free associative algebra gives a basis of left eigenfunctions for inverse 



shuffles, which are right eigenfunctions for the forward GSR riffle shuffles. By Remark 2 after 



Theorem 3.15, each word w 6 rl v corresponds to a right eigenfunction f w for the GSR measure 



on decks with composition v. As explained in Example 1.3 and Section 2.3, these are formed 
by factoring w into Lyndon words, standard bracketing each Lyndon factor, then expanding and 
summing over the symmetrization. The eigenvalue is a k ~ n with k the number of Lyndon factors of 
w. The following examples should help understanding. 

Example 5.2. For n = 3, with v = l 3 , 7i u is 6-dimensional with basis {xo-jo-ggg. Consider 
w = X1X2X3. This is a Lyndon word so no symmetrization is needed. The standard bracketing 
\(xix 2 x 3 ) = [A(aci), A(x 2 x 3 )] = [x±, [x%, x 3 ]] = x\X2X 3 — xix 3 x 2 — x 2 x 3 xi + X3X2X1. With the 
labeling of the transition matrix K a of (5.1 ), the associated eigenvector is (1, 1, 0, —1, 0, — 1) T with 
eigenvalue I /a 2 . 

For n = 3, v = (2, 1), % u is three-dimensional with basis x^X2, x±x 2 xi, X2x\. Consider w = x 2 x\. 
This factors into Lyndon words as X2-x\-x\\ symmetrizing gives the eigenvector x\ X2+x\x 2 x\+X2x\ 
or (1, 1, 1) with eigenvalue 1. 

The description of right eigenvectors can be made more explicit. This is carried toward in the 
following two sections. 

5.3.1 Right eigenfunctions for decks with distinct values 



Recall from Section 2.3 that the values of an eigenfunction f w at w' can be calculated graphically 
from the decreasing Lyndon hedg erow T w of w. When w has all letters distinct, this calculation 
simplifies neatly. To state this, extend the definition of /; (I a Lyndon word with distinct letters) 
to words longer than I, also with distinct letters: fi(w) is /; evaluated on the subword of w whose 
letters are those of I, if such a subword exists, and otherwise. (Here, a subword always consist of 
consecutive letters of the original word.) Because w has distinct letters, there is at most one such 
subword. For example, /3s(14253) = /3s(53) = —1. 

Proposition 5.3. Let w be a word with distinct letters and Lyndon factorization I1I2 ■ ■ - h- Then, 
for all w' with distinct letters and of the same length as w, f w (w') = fi 1 (w')fi 2 (w')...fi k (w') and f w 
takes only values 1,-1 and 0. 
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Example 5.4. /35142 (14253) = / 35 (14253)/i42(14253) = -1 • 1 = -1 as calculated infection 2.3 



Proof. Recall from Section 2.3 that f w (w') is the signed number of ways to permute the branches 
and trees of T w to spell w' . When w and w' each consist of distinct letters, such a permutation, if it 
exists, is unique. This gives the second assertion of the proposition. This permutation is precisely 
given by permuting the branches of each I). so they spell subwords of w' . The total number of 
branch permutations is the sum of the number of branch permutations of each . Taking parity 
of this statement gives the first assertion of the proposition. □ 

The example above employed a further shortcut that is worth pointing out: the Lyndon factors 
of a word with distinct letters start precisely at the record minima, since a word with distinct letters 
is Lyndon if and only if its first letter is minimal. Thus 35142 has minima 3, 1 in positions 1,3. 
This leads to 

Proposition 5.5. The multiplicity of the eigenvalue a n ~ k on H11 ...1 is c(n, k), the signless Stirling 
number of the first kind. 

Proof. By above observation, the multiplicity of the eigenvalue a n ~ k on Tii i . \ is the number of 
permutations with k record minima, which is also the number of permutations with k cycles, by 
|Sta97l Prop. 1.3.1]. □ 

Example 5.6 (Invariance under reversal). Let w denote the reverse of to, then, for any a, switching 
branches at every node shows that f a (w) = ±/<t(u;) where the parity is n— # Lyndon factors in 
a. Thus, each eigenspace of ^ a is invariant under the map w — > w. For example, /35i42(35241) = 



/35(35241)/i42(35241) = 1 • 1 = — /35142 (14253) when compared with Example 5.4 above. The 
quantity {n—jf Lyndon factors in 35142) is 5 — 2 = 3, hence the change in sign. 

Example 5.7 (Eigenvalue 1). a = n, n — 1, ...,1 is the only word with n Lyndon factors, so 
f a {w) = 1 spans the 1-eigenspace. 

Example 5.8 (Eigenvalue 1/a and descents). There are Q) permutations a which have n — 1 
Lyndon factors. They may be realized by choosing i < j and taking a = n, n — 1, . . . , j + 1, j — 
1, . . . , i + 1, i, j, i— 1, . . . , 1. Then, all but j are record minima and the corresponding eigenfunctions 
are (in the notation at the start of this subsection) , 



fij{ w ) 



1, if ij occurs as a subword of w, 
— 1, if ji occurs as a subword of w, 
0, otherwise. 



Their sum is f{w) := X]?.<j fij( w ) = n — 1 — 2d(w) with d{w) the number of descents in w. This is 



thus an eigenfunction with eigenvalue 1/a, as claimed in Example 1.4 This eigenfunction appears in 
the Markov chain recording the number of descents in successive a-shuffles, which is the same as the 
Markov chain of carries when n numbers are added in base a. The transition matrix of this Markov 
chain is Holte's [Hol97| "amazing matrix." See |DF09bl lDF09a] IDF12j . The theory developed there 
shows that, with s(n, k) the Stirling numbers (x(x — 1) • • • {x — n + 1) = X^>o s ( n ' k)x k ), 

/- „n 7 / \ , n — j) ( n — d(w) — 1\ 

(5-2) m-) = -'E x i^( n l£ J 

fc>o v 7 

is a right eigenfunction with eigenvalue l/W, < j < n — 1, and the eigenfunction / above is -h\. 
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Example 5.9 (Eigenvalue 1/a 2 and peaks). Recall that a permutation w has a peak at position 
i, 1 < i < re, if w(i— 1) < u>(i) > u>(i + l), and a trough at position i, 1 < i < re, if w(i— 1) > u;(i) < 
Call the remaining case a straight: w(i—l) < w(i) < w(i + l) OTw(i—l) > w(i) > w(i+l). 
The number of peaks and the peak set have been intensively investigated [Ste97l WS96j. The 
following development shows that # peaks(u>) — is an eigenfunction with eigenvalue 1/a 2 , as is 
# troughs(u>) — ^p. Indeed, a basis of this eigenspace is f a where a is obtained from re, n — 1, . . . , 1 
by removing j and inserting it after % for i < j, and then removing k > j and inserting it further 
down also. Then, all but j, k are record minima. There are three places to insert k (in the examples, 
i = 1, j = 3,fc = 5): 

1. after / with / / i, l^j,l<k (e.g., 42513); 

2. after j, i.e., a = re, re - 1, . . . , k + 1, k - 1, . . . ,j + l,j - 1, . . . , i + 1, i, j, k, i - 1, . . . , 1 (e.g., 
42135); 

3. after i, i.e., a = re, re — 1, . . . , k + 1, k — 1, . . . , j + 1, j — 1, . . . ,i + k, j, i — 1, . . . , 1 (e.g., 
42153). 

Then f a (w) is, respectively 

1. 1 if ij and fcZ both occur as subwords of w, or if ji and ZA; both occur; —1 if and kl both 
occur, or if ij and /A; both occur; if i is not adjacent to j in w; or if k is not adjacent to I 
(this is fij f ik); 

2. 1 if k or Ar/i occur as subwords of w; — 1 if ikj or j£;i occur; otherwise (this is fijfjk+fikfjk); 

3. 1 if ifcj or j&i occur as subwords of w; —1 if kij or jiA: occur; otherwise (this is —fikfjk + 

fij fik)- 



Proposition 5.10. f/\(w) := # peaks in w — fv(w) := # troughs in w — and f-(w) := 
# straights in w — are right eigenf unctions with eigenvalue 1/a 2 . 



Proof. Let Sj denote the sum of all eigenfunctions of type i, as defined in Example |5.9| above. Note 
that 

s 2 = ^2 fijfj* + fikfjk = # straights - # peaks 

i<j<k 

S3 = ^2 ~ fikfjk + fikfij = # peaks - # troughs. 

i<j<k 

Since each successive triple in u; either forms a straight, a peak or a trough, 

# straights + # peaks + # troughs = re — 2 
Hence /a = § ( s 3 — s 2)) /v = (-S2 + 2S3), /_ = g (2^2 + S3) are all in the l/o 2 -eigenspace. □ 

It may be possible to continue the analysis to patterns of longer length; in particular, it will be 
interesting to see which linear combinations of patterns and constant functions give eigenfunctions. 
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5.3.2 Right eigenfunctions for decks with general composition 



Recall from Proposition |5.l| that, for a composition v = (i>i,V2,--- ,vn) 01 n ) the map ^n^ a 
describes inverse a-shuffling for a deck of composition u, i.e., a deck of n cards where i>i cards have 



value i. Theorem 3.15 applies here to determine a full left eigenbasis (i.e., a right eigenbasis for 
forward shuffles). The special case of v = (n — 1,1) (follow one labeled card) is worked out in 
|Ciu98j and used to bound the expected number of correct guesses in feedback experiments. His 
work shows that the same set of eigenvalues {1, 1/a, 1/a 2 , . . . , l/a n_1 } occur. 

This section shows that this is true for all deck compositions (provided N > 1). It also deter- 
mines a basis of eigenfunctions with eigenvalue 1 /a and construct an eigenfunction which depends 



only on an appropriately defined number of descents, akin to Example 5.8 

The following proposition finds one "easy" eigenfunction for each eigenvalue of l/a k . The ex- 
amples that follow the proof show again that eigenfunctions can correspond to natural observables. 

Proposition 5.11. Fix a composition v of n. The dimension of the 1/ 'a -eigenspace for the a- 
shuffl.es of a deck of composition v is bounded below by the number of Lyndon words in the alphabet 
{1,2,. . . , N} of length k + 1 in which letter i occurs at most V{ times. In particular, l/a k does 
occur as an eigenvalue for each k, < k < n— 1. 



Proof. By remark 2 after Theorem 3.15 the multiplicity of the 1/a -eigenspace is the number of 
monomials in T~L U with n — k Lyndon factors. One way of constructing such monomials is to choose 
a Lyndon word of length k + 1 in which letter i occurs at most V{ times, and leave the remaining 
n — k letters of v as singleton Lyndon factors. The monomial is then obtained by putting these 
factors in decreasing order. This shows the lower bound. 

To see that l/a k is an eigenvalue for all k, it suffices to construct, for each k, a Lyndon word of 
length k + 1 in which letter i occurs at most Vi times. For k > v\, this may be achieved by placing 
the smallest k + 1 values in increasing order. For k < u\, take the word with k Is followed by a 
2. □ 

Example 5.12. For v = (3,2,1,2), the eight eigenfunctions constructed in the last step of the 
proof correspond to the words shown below in order l/a k , < k < 7. The bracketed term is the 
sole non-singleton Lyndon factor: 

44322111, 443(12)11, 4432(112)1, 4432(1112), 443(11122), 44(111223), 4(1112234), (11122344). 



Example 5.13. For an n-card deck of composition v, the second largest eigenvalue is 1/a. Our 
choice of eigenvectors correspond to words with n — 1 Lyndon factors. Each such word must have 
n— 2 singleton Lyndon factors and a Lyndon factor of length 2. Hence the bound in Proposition |5.1l| 
is attained; furthermore it can be explicitly calculated: the Lyndon words of length 2 are precisely 
a lower value followed by a higher value, so the multiplicity of eigenvalue 1/a is (^). This doesn't 
depend on v, only on the number iV of distinct values. 

Summing these eigenfunctions and arguing as in Example |5.8| gives 

Proposition 5.14. For any n-card deck of composition v, let a(w), d(w) be the number of strict 
ascents, descents in w respectively. Then a(w) — d(w) is an eigenfunction of *$> a with eigenvalue 
1/a. 
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Proof. Fix two values i < j. Order the deck in decreasing order, then take a card of value j and 
put it after the first card of value i. In other words, let ij be the only non-singleton Lyndon factor. 
By inspection, the corresponding eigenvector is (up to scaling) 

fij(w) = subwords ij in w} — {# subwords ji in w}; 

summing over 1 < i < j < N shows that ascents in w} — {# descents in w} is an 
eigenfunction with eigenvalue 1/a. □ 

Remarks. Under the uniform distribution, the expectation of a(w) — d(w) is zero. If initially the 
deck is arranged in increasing order w°, a(w°) — d(w°) = N — 1. If w k is the permutation after k 
a-shufnes, the proposition gives E(a(w k ) — d(w k )) = -\(N—1). Thus for a = 2, k = \og 2 (N — 1) + 6 
shuffles suffice to make this expected value 2~ e . On the other hand, consider a deck with n cards 
labeled 1 and n cards labeled 2. If the initial order is w = 11 • • • 12 • • • 21, a(w°) - d(w°) = and 
so E(a(w k ) — d{w k )) = for all k. 

Central limit theorems for the distribution of descents in permutations of multi-sets are devel- 
oped in [CV07j . 



Example 5.15. Specialize Example 5.13 to v = (1, n — 1), so there is one exceptional card of value 
1 in a deck of otherwise identical cards of value 2. Then there is a unique eigenfunction /12 of 
eigenvalue 1/a: 

1, if 1 is the top card, 
f\2{w) = \ — 1, if 1 is the bottom card, 
0, otherwise. 



6 Examples and counter-examples 

This section contains a collection of examples where either the Hopf-square map leads to a Markov 
chain with a reasonable "real world" interpretation — Markov chains on simplicial complexes and 
quantum groups — or the constructions do not work out to give Markov chains — a quotient of the 
symmetric functions algebra, Sweedler's Hopf algebra and the Steenrod algebra. Further examples 
will be developed in depth in future work. 

Example 6.1 (A Markov chain on simplicial complexes). Let X be a finite set and C a simplicial 
complex of subsets of X. Recall that this means that C is a collection of non-empty subsets of 
X such that c E C implies that all non-empty subsets of c are in C. As an example, consider the 
standard triangulation of the torus into 18 triangles: 




Here the top and bottom edges are identified as are the left and right sides. This identifies several 
vertices and edges and X consists of nine distinct vertices. The complex C contains these nine 
vertices, the 24 distinct edges, and the 18 triangles. 
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The set of all simplicial complexes (on all finite sets X) is a basis for a Hopf algebra, under 
disjoint union as product and coproduct 



sex 

with the sum over subsets S C X, and Cs = {A C S : A £ C}. By convention 0^ = 1 in this Hopf 
algebra so S = (f> is allowed in the sum. Graded by \X\, this gives a commutative, cocommutative 
Hopf algebra with basis given by all complexes C; the generators are the connected complexes. 

The associated Markov chain, restricted to complexes on n vertices, is simple to describe: from 
a given complex C, color the vertices red or blue, independently, with probability 1/2. Take the 
disjoint union of the complex induced by the red vertices with the complex induced by the blue 
vertices. As usual, the process terminates at the trivial complex consisting of n isolated vertices. 

This Markov chain is of interest in quantifying the results of a "topological statistics" analysis. 
There, a data set (n points in a metric space) gives rise to a family of complexes C t , < e < oo, 
where the vertices of each C e are the data points, and k points form a simplex if the intersection of 
the e balls around each point is non-empty in the ambient space. For e small, the complex is trivial. 
For e sufficiently large, the complex is the n-simplex. In topological statistics [CCdS06, Car09] one 
studies things like the Betti numbers of C e as a function of e. If these are stable for a range of e 
this indicates interpretable structure in the data. 

Consider now a data set with large n and e fixed. If a random subset of k points is considered 
(a frequent computational ploy) the induced sub-complex essentially has the distribution of the 
"painted red" sub-complex (if the painting is done with probability k/n). Iterating the Markov 
chain corresponds to taking smaller samples. 

If the Markov chain starts out at the n-simplex, every connected subset of the resulting Markov 
chain is a simplex. Thus, at each stage, all of the higher Betti numbers are zero and (3q after k steps 
is 2 fc — Xfc, where is distributed as the number of empty cells if n balls are dropped into 2 k boxes. 
This is a thoroughly studied problem [KSC78J. The distribution of the Betti numbers for more 
interesting starting complexes is a novel, challenging problem. Indeed, consider the triangulation of 
the torus with 2n 2 initial triangles. Coloring the vertices red or blue with probability 1/2, the edges 
with red/red vertices are distributed in the same way as the "open sites" in site percolation on a 
triangular lattice. Computing the Betti number /3q amounts to computing the number of connected 
components in site percolation. In the infinite triangular lattice, it is known that p = 1/2 is the 
critical threshold and at criticality, the chance that the component containing the origin has size 
greater than k falls off as /c -5 / 48 . These and related facts about site percolation are among the 
deepest results in modern probability. See |Gri99| IWer09t ISSG1 1| for background and recent results. 
Iterates of the Markov chain result in site percolation with p below the critical value but estimating 
/?o is still challenging. 

It is natural to study the absorption of this chain started at the initial complex C°. This can 



be studied using the results of Section 3.7 



Proposition 6.2. Let the simplicial complex Markov chain start at the complex C° . Let G° be 
the graph of the 1-skeleton ofC°. Suppose that the chromatic polynomial of G° is po(x). Then the 
probability of absorption after k steps is po(2 k )/2 nk (with n = \X\). 

For example, if C° is the n-simplex, po(x) = x ( x — 1) • • • (x — n + 1) and ^(absorption after k 
steps)= Ili^i (1 — i/2 k ) ~ e~ 2 c if k = 2(log 2 n + c) for n large. If C° is a tree, po(x) = x(x — l) n_1 
and P(absorption after k steps)= (1 — l/2 fc ) n_1 ~ e~ 2 if k = log 2 n c for n large. 
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Using results on the birthday problem in non-standard situations [BH J92 , 
to do similar asymptotics for variables such as the number of /-simplices remaining after k steps 
for more interesting starting C° such as a triangulation of the torus into 2(n — l) 2 triangles. 

As a final remark, note that the simplicial complex Markov chain induces a Markov chain on the 
successive 1-skeletons. The eigenvectors of this Markov chain are beautifully developed in (FislOj. 
These all lift to eigenvectors of the complex chain, so much is known. 

Example 6.3 (Quantized shuffle algebras). It is natural to seek useful deformations of processes 
like riffle shuffling. One route is via the quantized shuffle algebras of |Gre95l [Gre97] and |Ros95| 
IRos97[ IRos98j . These have become a basic object of study |Lec04| IKRllj . Consider the vector 
space k(xi, . . . ,x n ) equipped further with a symmetric Z form X{ ■ Xj. Turn this into an algebra 
with the product of concatentation. Take as coproduct A(xj) = \§t>Xi-\-Xi®l. However, A is to be 
multiplicative with respect to the twisted tensor product {x\ ® X2){yi ® 2/2) = q X2 ' Vl {x\X2 ® 2/13/2)- 
Green translates this into shuffling language. The upshot is if word in 

k(x±, . . . , x n ) then 

mA(w) = Yl q wt{S ' w) w s w s c. 

SC{l,2,...,k] 

Here the sum is over all subsets (including the empty set), wsw s c is the inverse shuffle moving the 
letters in the positions marked by S to the front (keeping them in the same relative order). The 
weight wt(5, w) is the sum of xy • Xj for f £ S c , j £ S, f < j. Thus if w = ijklm and S = {2, 4}, 
we have wswsc = jlikm, and wt(S, w) = i- j + i- l + k-l. 

When q = 1 this shuffle product gives Ree's shuffle algebra and general values of q lead to 
elegant combinatorial formulations of quantum groups. For general q > 0, there is also a naturally 
associated Markov chain. Work on the piece with multi-grading 1™, so each variable appears once 
and we may work with permutations in S n . For a starting permutation tt, and < j < n, let 
°U) = T,\s\=j <l wt{S ' n) ■ Set 9 = Ej=o #(•?)• Choose j with probability 9(J)/9 and then S (with 
\S\ = j) with probability q wt ( S ' w ^ /9(j). Move to ns^gc This defines a Markov transition matrix 
K q (ir,ir') via imA(7r). Note that the normalization 9 depends on it. 

We have not seen our way through this to nice mathematics. There is one case where progress 
can be made: suppose X{ ■ Xj = 1. Then wt(S, it) = inv(5), the minimum number of pairwise 
adjacent transpositions needed to move S to the left. (When n = 5 and S = {2,4}, inv(S) = 3.) 
Since wt(5, tt) doesn't depend on tt, neither does 9 and the Markov chain becomes a random walk 
on S n driven by the measure 




if a has a single descent, 
otherwise, 



where normalizing constant. 

The preceding description gives inverse riffle shuffles. It is straightforward to describe the q- 
analog of forward riffle shuffles by taking inverses. Let [j] q = 1 + q + ■ ■ ■ + g J_1 , [j] q \ = [j] q [j — 
l] q . . .[l] q , and [1\ q = [fc] [j^l-fc] ! ' t ne usua l ^-binomial coefficient. We write I(w) for the number 
of inversions of the permutation w and R(w) = dfa' 1 ) + 1 for the number of rising sequences. 



Proposition 6.4. For q > 0, the q-riffle shuffle measure has the following description on S n : 

fq^/zn, ifR(w)<2, 
I 0, otherwise, 



(6.1) Q g (w) 
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where z n is the normalizing constant ^2 w -r( w \<2 ^ • ^° 9 enera t e w from Q q , cut off j cards with 
probability [jL/^n an d drop cards sequentially according to the following rule: if at some stage 
there are A cards in the left pile and B cards in the right pile, drop the next card 

(6.2) from left with probability — — =- and from right with probability 



[A + B] q ' [A + B], 

Continue, until all cards have been dropped. 



Proof. Equation (6.1) follows from the inverse description because I(w) = I(w ). For the sequen- 



tial description, it is classical that [j] is the generating function for multi-sets containing j ones 
and n — j twos by number of inversion [Sta97[ Sect. 1.7]. For two piles with say, 1,2, ... ,j in the 
left and j + 1, . . . , k in the right, in order, dropping j induces k — j inversions. Multiplying the 
factors in (6.2) results in a permutation with R(w) < 2, with probability q 1 ^ / [ "] . Since the cut 



is made with probability [j] g /z n , the two-stage procedure gives (6.1). □ 



Remarks. When q = 1, this becomes the usual Gilbert-Shannon-Reeds measure described in the 



introduction and in Section 5 In particular, for the sequential version, the cut is j with probability 
(™) /2 n and with A in the left and B in the right, drop from left or right with probability A ^_ B , A ^_ B 
respectively. For general q, as far as we know, there is no closed form for z n . As q — > oo, the 
cutting distribution is peaked at n/2 and most cards are dropped from the left pile. The most 
likely permutation arises from cutting off n/2 cards and placing them at the bottom. As q — > 0, the 
cutting distribution tends to uniform on {0,1, ... ,n} and most cards are dropped from the right 
pile. The most likely permutation is the identity. There is a natural extension to a g-a-shume with 
cards cut into a piles according to the g-multinomial distribution and cards dropped sequentially 
with probability "(/-proportional" to packet size. 

We hope to analyze this Markov chain in future work. See [DROP] for related (/-deformations of 
a familiar random walk. 

Example 6.5 (A quotient of the algebra of symmetric functions) . Consider A = A/ei = k\ei, e^, . . . ], 
with 

n-2 



A (en) = 1 ® e n + Cj ® e n -j +e n ®l. 



This is the Weyl group invariants of type A. It is a polynomial algebra, so the theory in Section 3 



generates an eigenbasis of ^ a and ^* a . One hopes this will induce a rock-breaking process where 



pieces of size one are not allowed; however, we cannot rescale the basis via Theorem 3.4 to obtain a 
transition matrix, as both ei and are primitive basis elements of degree greater than one. Hence 
e\, e\ have the same degree, but toA^) = 8e| and mA(e^) = 4e|, so no rescaling can make the 
sum of the cofficients of m/S.(e^,) equal to that of mA(e|). 

Example 6.6 (Sweedler's example). The four-dimensional algebra 

Ha = k(l,g,x,gx : g 2 = 1, x 2 = 0, xg = -gx) 

becomes a Hopf algebra with A(g) = g® g, A(x) = x <g)l +g®x, e(g) = 1, e(x) = 0. The antipode 
is s(g) = g~ 1 , s(x) = —gx. It is discussed in |Mon93j as an example of a Hopf algebra that is 
neither commutative nor cocommutative. With the given basis {1, g, x, gx}, mA(l) = 1, mA(g) = 
1, mA(x) = x — gx, mA(gx) = —x + gx. The negative coefficients forstall our efforts to find a 
probabilistic interpretation. The element v = x — gx is an eigenvector for mA with eigenvalue 2 
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and high powers of t^tuA applied to a general element a + bg + cx + dgx converge to (c — d)v. Of 
course, this example violates many of our underlying assumptions. It is not graded and is neither 
a polynomial algebra nor a free associative algebra. 

Example 6.7 (The Steenrod algebra). Steenrod squares (and higher powers) are a basic tool 
of algebraic topology [Hat02j. They give rise to a Hopf algebra A2 over F2. Its dual A* 2 is a 
commutative, noncocommutative Hopf algebra over ¥2 with a simple description. As an algebra, 
A\ = ¥2[xx,X2, ■ ■ ■) (polynomial algebra in countably many variables) graded with X{ of degree 
2 l — 1. The coproduct is A(x n ) = Y17=o x n-i ® x i ( x o = !)• Alas, because the coefficients are mod 
2, we have been unable to find a probabilistic interpretation of mA. For example, (^2)3 has basis 
{21,22} and mA{x\) = 0, mA(x2) = x\ so (mA) 2 = 0. Of course, high powers of operators can 
be of interest without positivity |DSC121 IGM09] . 
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